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I, L. MICHAEL FURNESS, a citizen of the United Kingdom, residing at 2 
Brookside, Exning, Newmarket, United Kingdom, declare that: 

1 . I was employed by Incyte Corporation (hereinafter "Incyte") as a Director 
of Pharmacogenomics until December 31, 2001. I am currently under contract to be a Consultant 
to Incyte Corporation. 

2. In 1984, I received a B.Sc.(Hons) in Biomolecular Science (Biophysics 

and Biochemistry) from Portsmouth Polytechnic. 

From 1985-1987 I was at the School of Pharmacy in London, United Kingdom, 
during which time I analyzed lipid methyltransferase enzymes using a variety of protein analysis 
methods, including one-dimensional (ID) and two-dimensional (2D) gel electrophoresis, HPLC, 
and a variety of enzymatic assay systems. 
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I then worked in the Protein Structure group at the National Institute for Medical 
Research until 1989, setting up core facilities for nucleic acid synthesis and sequencing, as well 
as assisting in programs on protein kinase C inhibitors. 

After a year at Perkin Elmer-Applied Biosystems as a technical specialist, I 
worked at the Imperial Cancer Research Fund between 1990-1992, on a Eureka-funded program 
collaborating with Amersham Pharmacia in the United Kingdom and CEPH (Centre d'Etude du 
Polymorphisme Humaine) in Paris, France, to develop novel nucleic acid purification and 
characterization methods. 

In 1992, 1 moved to Pfizer Central Research in the United Kingdom, where I 
stayed until 1998, initially setting up core DNA sequencing and then a DNA arraying facility for 
gene expression analysis in 1993. My work also included bioinformatics, and I was responsible 
for the support of all Pfizer neuroscience programs in the United Kingdom. This then led me 
into carrying out detailed bioinformatics and wet lab work on the sodium channels, including 
antibody generation, western and northern analyses, PCR, tissue distribution studies, and 
sequence analyses on novel sequences identified. 

In 1998, 1 moved to Incyte Genomics, Inc., to the Pharmacogenomics group to 
look at the application of genomics and proteomics to the pharmaceutical industry. In 1999, 1 
was appointed director of the LifeExpress Lead Program which used microarray and protein 
expression data to identify pharmacologically and toxicologically relevant mechanisms to assist 
in improved drug design and development. 

On December 12, 2001 1 founded Nuomics Consulting Ltd., in Exning, U.K., and 
I am currently employed as Managing Director. Nuomics Consulting Ltd. provides expert 
technical knowledge and advice to businesses around the areas of genomics, proteomics, 
pharmacogenomics, toxicogenomics and chemogenomics. 

3. I have reviewed the specification of a United States patent application that 
I understand was filed on July 5, 2001 in the names of Preeti Lai et al. and was assigned Serial 
No. 09/701,232 (hereinafter "the Lai '232 application"). Furthermore, I understand that this 
United States patent application is the National Stage of International Application No. 
PCT/US99/1 1497, filed May 25, 1999, and published in English as WO 99/61614 on December 
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2, 1999, which claims the benefit under 35 U.S.C. § 119(e) of provisional applications U.S. Ser . 
No. 60/087,104, filed May 28, 1998 (hereinafter the Lai '104 application) and U.S. Ser. No. 
60/150,701, filed December 17, 1998. The provisional applications provide support for what is 
disclosed in the instant Lai '232 application. The SEQ ID NO:5 and SEQ ID NO: 14 sequences 
recited in the Lai '232 application claims were first disclosed in the Lai '104 application and 
listed as SEQ ID NO:5 and SEQ ID NO:l 1, respectively, in the Lai '104 application. My 
remarks herein will therefore be directed to the Lai '104 patent application, and May 28, 1998, as 
the relevant date of filing. In broad overview, the Lai '104 specification pertains to certain 
nucleotide and amino acid sequences and their use in a number of applications, including gene 
and protein expression monitoring applications that are useful in connection with (a) developing 
drugs (e.g., for the treatment of cancer), and (b) monitoring the activity of drugs for purposes 
relating to evaluating their efficacy and toxicity. 

4. I understand that (a) the Lai '232 application contains claims that are 
directed to a isolated polypeptide comprising the amino acid sequence of SEQ ID NO: 5 
(hereinafter "the SEQ ID N0:5 polypeptide"), and (b) the Patent Examiner has rejected those 
claims on the grounds that the specification of the Lai '232 application does not disclose a 
specific and substantial asserted utility or a well established utility for the claimed SEQ ID N0:5 
polypeptide. I further understand that whether or not a patent specification discloses a 
substantial, specific and credible utility for its claimed subject matter is properly determined 
from the perspective of a person skilled in the art to which the specification pertains at the time 
of the patent application was filed. In addition, I understand that a substantial, specific and 
credible utility under the patent laws must be a "real-world" utility. 

5. I have been asked (a) to consider with a view to reaching a conclusion (or 
conclusions) as to whether or not I agree with the Patent Examiner's position that the Lai '232 
application and its priority application, the Lai '104 application, do not disclose a specific and 
substantial asserted utility or a well established "real-world" utility for the claimed SEQ ID NO: 5 
polypeptide, and (b) to state and explain the bases for any conclusions I reach. I have been 
informed that, in connection with my considerations, I should determine whether or not a person 
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skilled in the art to which the Lai '104 application pertains on May 28, 1998, would have 
concluded that the Lai '104 application disclosed, for the benefit of the public, a specific 
beneficial use of the SEQ ID NO: 5 polypeptide in its then available and disclosed form. I have 
also been informed that, with respect to the "real-world" utility requirement, the Patent and 
Trademark Office instructs its Patent Examiners in Section 2107.01 of the Manual of Patent 
Examining Procedure, 8* Edition, August 2001, under the heading L Specific and Substantial 
Requirements, Research Tools): 

Many research tools such as gas chromatographs, screening assays, and nucleotide 
sequencing techniques have a clear, specific and unquestionable utility (e.g., they 
are useful in analyzing compounds). An assessment that focuses on whether an 
invention is useful only in a research setting thus does not address whether the 
specific invention is in fact "useful" in a patent sense. Instead, Office personnel 
must distinguish between inventions that have a specifically identified substantial 
utility and inventions whose asserted utility requires further research to identify or 
reasonably confirm. 

6. I have considered the matters set forth in paragraph 5 of this Declaration 
and have concluded that, contrary to the position I understand the Patent Examiner has taken, the 
specification of the Lai '104 patent application disclosed to a person skilled in the art at the time 
of its filing a number of substantial, specific and credible real-world utilities for the claimed SEQ 
ID NO:5 polypeptide. More specifically, persons skilled in the art on May 28, 1998 would have 
understood the Lai '104 application to disclose the use of the SEQ ID NO:5 polypeptide as a 
research tool in a number of gene and protein expression monitoring applications that were well- 
known at that time to be useful in connection with the development of drugs and the monitoring 
of the activity of such drugs. I explain the bases for reaching my conclusion in this regard in 
paragraphs 7-13 below. 

7. In reaching the conclusion stated in paragraph 6 of this Declaration, I 
considered (a) the specification of the Lai '104 application, and (b) a number of published 
articles and patent documents that evidence gene and protein expression monitoring techniques 
that were well-known before the May 28, 1998 filing date of the Lai '104 application. The 
published articles and patent documents I considered are: 
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(a) Anderson, N.L., Esquer-Blasco, R., Hofmann, J.-P., Anderson, 
N.G., A Two-Dimensional Gel Database of Rat Liver Proteins Useful in Gene Regulation and 
Drug Effects Studies . Electrophoresis, 12, 907-930 (1991) (hereinafter "the Anderson 1991 
article") (copy annexed at Tab A); 

(b) Anderson, N.L., Esquer-Blasco, R., Hofmann, J.-P., Mehues, L., 
Raymackers, J., Steiner, S. Witzmann, P., Anderson, N.G.. An Updated Two-Dimensional Gel 
Database of Rat Liver Proteins Useful in Gene Regulation and Drug Effect Studies . 
Electrophoresis, 16, 1977-1981 (1995) (hereinafter "the Anderson 1995 article") (copy annexed 
at Tab B); 

(c) Wilkins, M.R., Sanchez, J.-C, Gooley, A.A., Appel, R.D., 
Humphery-Smith, 1., Hochstrasser, D.F., WilUams, K.L., Progress with Proteome Projects: Whv 
all Proteins Expressed bv a Genome Should be Identified and How To Do It . Biotechnology and 
Genetic Engineering Reviews, 13, 19-50 (1995) (hereinafter "the Wilkins article") (copy annexed 
at Tab C); 

(d) Cells, I.E., Rasmussen, H.H., Leffers, H., Madsen, P., Honore, B., 
Gesser, B., Dejgaard, K., Vandekerckhove, J.. Human Cellular Protein Patterns and their Link to 
Genome DNA Sequence Data: Usefulness of Two-Dimensional Gel Electrophoresis and 
Microsequencing. FASEB Journal, 5, 2200-2208 (1991) (hereinafter "the CeUs article") (copy 
annexed at Tab D); 

(e) Franzen, B., Linder, S., Okuzawa, K., Kato, H., Auer, G., 
Nonenzvmatic Extraction of Cells from Clinical Tumor Material for Analysis of Gene 
Expression bv Two-Dimensional Poly acryl amide Gel Electrophoresis . Electrophoresis, 14, 1045- 
1053 (1993) (hereinafter "the Franzen article") (copy annexed at Tab E); 

(f) Bjellqvist, B., Basse, B., Olsen, E., Celis, J.E., Reference Points 
for Comparisons of Two-Dimensional Maps of Proteins from Different Human Cell Types 
Defined in a pH Scale Where Isoelectric Points Correlate with Polypeptide Compositions . 
Electrophoresis, 15, 529-539 (1994) (hereinafter "the Bjellqvist article") (copy annexed at 
Tab F); and 

(g) Large Scale Biology Company Info; LSB and LSP Information; 
from http://www.lsbc.com (2001) (copy annexed at Tab G). 
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8. Many of the published articles I considered (i.e., at least items (a)-(f) 
identified in paragraph 7) relate to the development of protein two-dimensional gel 
electrophoretic techniques for use in protein expression monitoring applications in drug 
development and toxicology. As I will discuss below, a person skilled in the art who read the 
Lai '104 apphcation on May 28, 1998 would have understood that application to disclose the 
SEQ ID NO:5 polypeptide to be useful for a number of gene and protein expression monitoring 
applications, e.g., in the use of two-dimensional polyacrylamide gel electrophoresis and western 
blot analysis of tissue samples in drug development and in toxicity testing. 

Furthermore, items (a)-(f) establish that protein two-dimensional polyacrylamide 
gel electrophoresis and western blot analysis were well-known and established methods routinely 
used in toxicology testing and drug development at the time of filing the Lai '104 application and 
for several years prior to May 28, 1998. As such, one of ordinary skill in the art would have 
recognized that the polypeptide of SEQ ID NO: 5 could be used in toxicology testing and drug 
development, irrespective of its biochemical activities. 

9. The SEQ ID NO:5 and SEQ ID NO: 14 sequences recited in the Lai '232 
application claims were first disclosed in the Lai '104 application and listed as SEQ ID N0:5 and 
SEQ ID NO: 1 1, respectively, in the Lai '104 application. The SEQ ID NO:5 polypeptide is 
referred to as HSCOP-5 in the Lai '232 application and as SOCP-5 in the Lai '104 application. 
Turning more specifically to the Lai '104 specification, the SEQ ID NO:5 polypeptide is shown 
at pages 46-47 under the heading "Sequence Listing." The Lai '104 specification specifically 
teaches that the "invention features substantially purified polypeptides, human SOCS proteins , 
referred to collectively as 'SOCP' and individually as 'SOCP-l', 'SOCP-2', 'SOCP-3', 'SOCP- 
4', 'SOCP-5', and 'SOCP-6' and that the "invention provides a substantially purified polypeptide 
comprising an amino acid sequence selected from the group consisting of SEQ ID NO:l, SEQ ED 
N0:2, SEQ ID N0:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6 (SEQ ID NO:l through 6). . . 
" (Lai '104 application at page 2, lines 32-36). It further teaches that (a) the identity of the SEQ 
ID NO:5 polypeptide was determined from a uterus tissue cDNA library (UTRSNOROl) (Lai 
'104 application, Tables 1 and 4), (b) the SEQ ID NO: 5 polypeptide is the human SOCS protein 
referred to as "SOCP-5" and is encoded by SEQ ID NO:ll. (Lai '104 application at page 2, lines 
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31-36 and Table 1), and (c) northern analysis of SEQ ID NO: 11 shows its expression 
predominantly in cDNA libraries made from reproductive, cardiovascular, 
hematopoietic/immune, cancer-associated, inflammation-associated, and fetal tissues (Lai '104 
application at Table 3) and therefore "SOCP appears to play a role in cancer, immune disorders, 
and infectious diseases." (Lai '104 application at page 20, lines 22-23.) 

The Lai ' 104 application discusses a number of uses of the SEQ ED NO:5 
polypeptide in addition to its use in gene and protein expression monitoring applications. I have 
not fully evaluated these additional uses in connection with the preparation of this Declaration 
and do not express any views in this Declaration regarding whether or not the Lai '104 
specification discloses these additional uses to be substantial, specific and credible real-world 
utilities of the SEQ ID NO:5 polypeptide. Consequently, my discussion in this Declaration 
concerning the Lai '104 application focuses on the portions of the appHcation that relate to the 
use of the SEQ ID NO:5 polypeptide in gene and protein expression monitoring applications. 

10. The Lai '104 application discloses that the polynucleotide sequences 
disclosed therein, including the polynucleotides encoding the SEQ ID NO:5 polypeptide, are 
useful as probes in chip based technologies. It further teaches that the chip based technologies 
can be used "for the detection and/or quantification of nucleic acid or protein sequences." (Lai 
'104 application at page 18, lines 27-28.) 

The Lai '104 application also discloses that the SEQ ID NO:5 polypeptide is 
useful in other protein expression detection technologies. The Lai '104 application states that 
"[i]mmunological methods for detecting and measuring the expression of SOCP using either 
specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 
include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 
fluorescence activated cell sorting (FACS)." (Lai '104 application at page 18, lines 29-32.) 
Furthermore, the Lai '104 application discloses that "[a] variety of protocols for measuring 
SOCP, including ELISAs, RIAs, and FACS, are known in the art and provide a basis for 
diagnosing altered or abnormal levels of SOCP expression. Normal or standard values for SOCP 
expression are established by combining body fluids or cell extracts taken from normal 
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mammalian subjects, preferably human, with antibody to SOCP under conditions suitable for 
complex formation." (Lai '104 application at page 28, lines 5-9.) 

In addition, at the time of filing the Lai ' 104 application, it was well known in the 
art that gene and protein expression analyses also included two-dimensional polyacrylamide gel 
electrophoresis (2-D PAGE) technologies, which were developed during the 1980s, and as 
exemplified by the Anderson 1991 and 1995 articles (Tab A and Tab B). The Anderson 1991 
article teaches that a 2-D PAGE map has been used to connect and compare hundreds of 2-D gels 
of rat liver samples from a variety of studies including regulation of protein expression by 
various drugs and toxic agents (Tab A at page 907). The Anderson 1991 article teaches an 
empirically-determined standard curve fitted to a series of identified proteins based upon amino 
acid chain length (Tab A at page 91 1) and how that standard curve can be used in protein 
expression analysis. The Anderson 1991 article teaches that "there is a long-term need for a 
comprehensive database of liver proteins" (Tab A at page 912). 

The Wilkins article is one of a number of documents that were published prior to 
the May 28, 1998 filing date of the Lai '104 application that describes the use of the 2-D PAGE 
technology in a wide range of gene and protein expression monitoring applications, including 
monitoring and analyzing protein expression patterns in human cancer, human serum plasma 
proteins, and in rodent liver following exposure to toxins. In view of the Lai ' 104 application, 
the Wilkins article, and other related pre-May 28, 1998 publications, persons skilled in the art on 
May 28, 1998 clearly would have understood the Lai '104 application to disclose the SEQ ID 
NO:5 polypeptide to be useful in 2-D PAGE analyses for the development of new drugs and 
monitoring the activities of drugs for such purposes as evaluating their efficacy and toxicity, as 
explained more fully in paragraph 12 below. 

With specific reference to toxicity evaluations, those of skill in the art who were 
working on drug development on May 28, 1998 (and for many years prior to May 28, 1998) 
without any doubt appreciated that the toxicity (or lack of toxicity) of any proposed drug they 
were working on was one of the most important criteria to be considered and evaluated in 
connection with the development of the drug. They would have understood at that time that 
good drugs are not only potent, they are specific. This means that they have strong effects on a 
specific biological target and minimal effects on all other biological targets. Ascertaining that a 
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candidate drug affects its intended target, and identification of undesirable secondary effects (i.e., 
toxic side effects), had been for many years among the main challenges in developing new drugs. 
The ability to determine which genes are positively affected by a given drug, coupled with the 
ability to quickly and at the earliest time possible in the drug development process identify drugs 
that are likely to be toxic because of their undesirable secondary effects, have enormous value in 
improving the efficiency of the drug discovery process, and are an important and essential part of 
the development of any new drug. In fact, the desire to identify and understand toxicological 
effects using the experimental assays described above led Dr Leigh Anderson to found the Large 
Scale Biology Corporation in 1985, in order to pursue commercial development of the 2-D 
electrophoretic protein mapping technology he had developed. In addition, the company focused 
on toxicological effects on the proteome as clearly demonstrated by its goals and by its senior 
management credentials described in company documents (see Tab G at pages 1, 3, and 5). 

Accordingly, the teachings in the Lai '104 application, in particular regarding use 
of SEQ ID NO:5 in differential gene and protein expression analysis (2-D PAGE maps) and in 
the development and the monitoring of the activities of drugs, clearly include toxicity studies and 
persons skilled in the art who read the Lai '104 application on May 28, 1998 would have 
understood that to be so. 

11. As previously discussed {supra, paragraphs 7 and 8), my experience with 
protein analysis methods in the mid-1980s and the several publications annexed to this 
Declaration at Tabs A through F evidence information that was available to the public regarding 
two-dimensional polyacrylamide gel electrophoresis technology and its uses in drug discovery 
and toxicology testing before the May 28, 1998 filing date of the Lai '104 application. In 
particular the Cells article stated that "protein databases are expected to foster a variety of 

biological information.... ~ among others, drug development and testing" (See Tab D, page 

2200, second column). The Franzen article shows that 2-D PAGE maps were used to identify 
proteins in clinical tumor material (See Tab E). The Lai '104 application clearly discloses that 
expression of SOCP-5 is associated with reproductive, cardiovascular, hematopoietic/immune, 
cancer-associated, inflammation-associated, and fetal tissues (Lai '104 application at Table 3). 
The Bjellqvist article showed that a protein may be identified accurately by its positional co- 
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ordinates, namely molecular mass and isoelectric point (See Tab F). The Lai '104 application 
clearly disclosed SEQ ID NO:5 from which it would have been routine for one of skill in the art 
to predict both the molecular mass and the isoelectric point using algorithms well known in the 
art at the time of filing. 

12. A person skilled in the art on May 28, 1998, who read the Lai ' 104 
application, would understand that application to disclose the SEQ ID NO:5 polypeptide to be 
highly useful in analysis of differential expression of proteins. For example, the specification of 
the Lai '104 application would have led a person skilled in the art on May 28, 1998 who was 
using protein expression monitoring in connection with working on developing new drugs for the 
treatment of cancer, immune disorders, and infectious diseases to conclude that a 2-D PAGE map 
that used the isolated SEQ ID NO:5 polypeptide would be a highly useful tool and to request 
specifically that any 2-D PAGE map that was being used for such purposes utilize the SEQ ID 
NO: 5 polypeptide sequence. Expressed proteins are useful for 2-D PAGE analysis in toxicology 
expression studies for a variety of reasons, particularly for purposes relating to providing controls 
for the 2-D PAGE analysis, and for identifying sequence or post-translational variants of the 
expressed sequences in response to exogenous compounds. Persons skilled in the art would 
appreciate that a 2-D PAGE map that utilized the SEQ ID NO:5 polypeptide sequence would be 
a more useful tool than a 2-D PAGE map that did not utilize this protein sequence in connection 
with conducting protein expression monitoring studies on proposed (or actual) drugs for treating 
cancer, immune disorders, and infectious diseases for such purposes as evaluating their efficacy 
and toxicity. 

I discuss in more detail in items (a)-(b) below a number of reasons why a person 
skilled in the art, who read the Lai '104 specification on May 28, 1998, would have concluded 
based on that specification and the state of the art at that time, that SEQ ID NO:5 polypeptide 
would be a highly useful tool for analysis of a 2-D PAGE map for evaluating the efficacy and 
toxicity of proposed drugs for cancer, immune disorders, and infectious diseases by means of 2- 
D PAGE maps, as well as for other evaluations: 

(a) The Lai '104 specification contains a number of teachings that would 
lead persons skilled in the art on May 28, 1998 to conclude that a 2-D PAGE map that utilized 
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the isolated SEQ ID NO:5 polypeptide would be a more useful tool for protein expression 
monitoring applications relating to drugs for treating cancer, immune disorders, and infectious 
diseases than a 2-D PAGE map that did not use the SEQ ID NO:5 polypeptide sequence. Among 
other things, the Lai '104 specification teaches that (i) the identity of the SEQ ID NO:5 
polypeptide was determined from a "uterus tissue cDNA library (UTRSNOROl)," (Lai '104 
application, Tables 1 and 4) (ii) the SEQ ID NO: 5 polypeptide is the human SOCS protein 
referred to as "SOCP-5" (listed as HSCOP-5 in the Lai '232 application) (Lai '104 application at 
page 2, lines 31-36 and Table 1), and (iii) SEQ ID NO:l 1 (listed as SEQ ID NO: 14 in the Lai 
'232 application) is expressed predominantly in cDNA libraries made from reproductive, 
cardiovascular, hematopoietic/immune, cancer-associated, inflammation-associated, and fetal 
tissues (Lai '104 application at Table 3) and therefore "SOCP appears to play a role in cancer, 
immune disorders, and infectious diseases." (Lai '104 application at page 20, lines 22-23; see 
paragraph 9, supra). The isolated polypeptide could therefore be used as a control to more 
accurately gauge the expression of SOCP-5 (listed as HSCOP-5 in the Lai '232 application) in 
the sample and consequently more accurately gauge the affect of a toxicant on expression of the 
gene. 

(b) Persons skilled in the art on May 28, 1998 would have appreciated (i) 
that the protein expression monitoring results obtained using a 2-D PAGE map that utilized a 
SEQ ID NO: 5 polypeptide would vary, depending on the particular drug being evaluated, and (ii) 
that such varying results would occur both with respect to the results obtained from the SEQ ID 
NO:5 polypeptide and from the 2-D PAGE map as a whole (including all its other individual 
proteins). These kinds of varying results, depending on the identity of the drug being tested, in 
no way detracts from my conclusion that persons skilled in the art on May 28, 1998, having read 
the Lai '104 specification, would specifically request that any 2-D PAGE map that was being 
used for conducting protein expression monitoring studies on drugs for treating cancer, immune 
disorders, and infectious diseases {e.g., a toxicology study or any efficacy study of the type that 
typically takes place in connection with the development of a drug) utilize the SEQ ID NO:5 
polypeptide sequence. Persons skilled in the art on May 28, 1998 would have wanted their 2-D 
PAGE map to utilize the SEQ ID NO:5 polypeptide sequence because a 2-D PAGE map that 
utilized protein sequence information the polypeptide (as compared to one that did not) would 



111188 



11 



09/701,232 



Docket No.: PF-0525 USN 

provide more useful results in the kind of protein expression monitoring studies using 2-D PAGE 
maps that persons skilled in the art have been doing since well prior to May 28, 1998. 

The foregoing is not intended to be an all-inclusive explanation of all my reasons 
for reaching the conclusions stated in this paragraph 12, and in paragraph 6, supra. In my view, 
however, it provides more than sufficient reasons to justify my conclusions stated in paragraph 6 
of this Declaration regarding the Lai ' 104 application disclosing to persons skilled in the art at 
the time of its filing substantial, specific and credible real-world utilities for the SEQ ID N0:5 
polypeptide. 

13. Also pertinent to my considerations underlying this Declaration is the fact 
that the Lai '104 disclosure regarding the uses of the SEQ ID NO:5 polypeptide for protein 
expression monitoring applications is not limited to the use of that protein in 2-D PAGE maps. 
For one thing, the Lai '104 disclosure regarding the technique used in gene and protein 
expression monitoring applications is broad. (Lai '104 application at, e.g., page 18, lines 24-28.) 

In addition, the Lai '104 specification repeatedly teaches that the protein described 
therein (including the SEQ ID NO:5 polypeptide) may desirably be used in any of a number of 
long established "standard" techniques, such as ELISA or western blot analysis, for conducting 
protein expression monitoring studies. See, e.g.: 

(a) Lai '104 application at page 18, lines 29-32 ("Immunological methods 
for detecting and measuring the expression of SOCP using either specific polyclonal or 
monoclonal antibodies are known in the art. Examples of such techniques include 
enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and fluorescence 
activated cell sorting (FAGS)."); 

(b) Lai '104 application at page 28, lines 5-12 ("A variety of protocols for 
measuring SOCP, including ELISAs, RIAs, and FACS, are known in the art and provide a basis 
for diagnosing altered or abnormal levels of SOCP expression. Normal or standard values for 
SOCP expression are established by combining body fluids or cell extracts taken from normal 
mammalian subjects, preferably human, with antibody to SOCP under conditions suitable for 
complex formation The amount of standard complex formation may be quantitated by various 
methods, preferably by photometric means. Quantities of SOCP expressed in subject, control. 
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and disease samples from biopsied tissues are compared with the standard values. Deviation 
between standard and subject values establishes the parameters for diagnosing disease."). 

Thus a person skilled in the art on May 28, 1998, who read the Lai '104 
specification, would have routinely and readily appreciated that the SEQ ID NO: 5 polypeptide 
disclosed therein would be useful to conduct protein expression monitoring analyses using 2-D 
PAGE mapping or western blot analysis or any of the other traditional membrane-based protein 
expression monitoring techniques that were known and in common use many years prior to the 
filing of the Lai '104 application. For example, a person skilled in the art on May 28, 1998 
would have routinely and readily appreciated that the SEQ ID NO:5 polypeptide would be a 
useful tool in conducting protein expression analyses, using the 2-D PAGE mapping or western 
analysis techniques, in furtherance of (a) the development of drugs for the treatment of cancer, 
immune disorders, and infectious diseases, and (b) analyses of the efficacy and toxicity of such 
drugs. 
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14. I declare further that all statements made herein of my own knowledge are 
true and that all statements made herein on information and belief are believed to be true; and 
further, that these statements were made with the knowledge that willful false statements and the 
like so made are punishable by fine or imprisonment, or both, and that willful false statements 
may jeopardize the validity of this application and any patent issuing thereon. 



L. Michael Fumess, B.Sc. 

Signed at Exning, United Kingdom 

this day of , 2003 
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detecting a polynucleotide encoding said polypeptide; said 
method wherein the polynucleotide is amplified by applying 
PCR; an isolated and purified polynuceotide comprising 
polynucleotide sequence SEQ ID NO. 10 and fragments thereof, 
or a variant having at least 90% identity; a polynucleotide 
having a sequence which is complementary to said 
polynucleotide; an expression vector comprising at least a 
fragment of said polynucleotide; a host cell comprising said 
expression vector; a method for producing a polypeptide 
comprising amino acid sequence SEQ ID NO. 1; a 
pharmaceutical composition comprising said polypeptide in 
conjunction with a suitable pharmaceutical carrier; an 
antibody which specifically binds to said polypeptide. 
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8. Claims: 1-16, 19 (all partially) 
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A two-dimensional gel database of rat liver proteins 
useful in gene regulation and drug effects studies 

A standard two-dimensional (2-D) protein map of Fischer 344 rat liver 
(F344MST3) is presented, with a ubular listing of more than 1200 protem species. 
Sodium dodccyl sulfate (SDS) molecular mass and isoelectric point have been es- 
ublished, based on positions of numerous internal standards. This inap has been 
used to connect and compare hundreds of 2-D gels of rat I'v" samples frorn a va- 
riety of studies, and forms the nucleus of an expandmg database descnbing rat 
liver proteins and their regulation by various drugs and loxic agents. An example 
of such a study, involving regulation of cholesterol synthesis by cholesterol-lower- 
ing drugs and a high-cholesterol diet, is presented. Smce the map has been ob- 
uined with a widely used and highly reproducible 2-D gel system (the lso-Dalt» 
system), it can be directly related to an expanding body of work m other laborato- 
ries. 
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1 Introduction 

High-resolution two-dimensional electrophoresis of pro- 
teins, introduced in 1975 by O'FarreU and others ll^].has 
been used over the ensuing 1« years to exammc a wide va- 
riety of biological systems, the results appearing in more 
than 5000 published papers. With the advent of computer- 
ized systems for analyzing two-dimensional (2-D) gel ima- 
ges and constructing spot databases, it is also possible to 
plan and assemble integrated bodies of information de- 
scribing the appearance and regulation of thousands of pro- 
tein gene products (5, 61. Creating such databases involves 
amassing and organizing quantitative data from thousands 
of 2-D gels, and requires a substantial commitment in tech- 
nology and resources. 

Given the long-term effort required to develop a protein da- 
tabase, the choice of a biological system takes on consider- 
able importance. While in viiro systems are ideal foranswer- 
in^ manv experiminial queitioni. especially in cancer re- 
search and genetics, our experienc; wiih cell cultures atid 
tissue samples suggests that some in vivo approaches could 
have major advantages. In panicular. we have noticed that 
liver tissue samples from rats and mice appear to show grea- 
ter quantitative reproducibility (in terms of individual pro- 
tein expression) than replicate cell cultures.This is perhaps 
a natural result of the homeostasis maintained m a com- 
plete animal vs. the well-laiown vanability of cell cultures, 
the latter due principally to differences in reagents (e.g.. 
fetal bovine serum). conditions ie.::.. pH ) and genetic "evo- 
lution^ of cell lines while in culture. It is also more difTiculi 
to generate adequate amounts of protein from cell culture 
systems (particularly with attached ceils), forcing the inves- 
tigator to resort to radioisoiope-based or silver-based stain- 
detection methods. While these methods are more sensi- 
tive (sometimes much more sensitive) than the Coomassie 
Brilliant Blue (CBB) stain to pically used for protem detec- 
tion in -large" proiein samples, they are generally more van- 
able, more labor-intensive and. in the case of radiographic 
methods, may generate highly -notsy" images, due to the 
properties of the films used. By contrast, large protein sam- 
ples can easily be prepared from liver using urea/Nonidet 
P-JO (NP-40) solubilization and stained with CBB. which 
has the advantage of being easily reproducible [8]. Finally, 
there remains the question of the nr\ithfulness"of many in 
vitro systems as compared to their in vivo analogs; how 
great are the changes caused by the introduction into a cul- 

0irM)»35/9l/im-«907 $3.5»»Ji/0 



ture and the tssocxaied shift to strong selection for growth 
arid how do these a/Tea erpcrimeniaJ omcomes? He^ce 
Slrimfr. *>f '« sy«ems. in terms of ex- 

penmental manipulauon. may be counterbalanced bv 
other factors relaung to 2-D data quality. 

There is a second imponani class of reasons for exploring 

toncally. there have been tmo broad approaches to the me- 
S of biochemical processes in intact Sl- 

Inf.? for informative mutants) 

Both approaches help us to understand complex svstems 
bj disruptmg some specific functional element and show- 
mg us the result. VMth the development of techniques J^r 
«n K cloning, the genetic approach 

can be effectively applied either in vitro or in vivo, although 

can also be applied to eithersort of biological svstem: here 
?n "y"' "nsi^^ntly acquired information is' 

m expenmeniaJ animals (rats and mice). While most biolo- 
mtn,^i°** r''?"i" °^ compounds having specific, experi- 
^^oS °fP™teir» synthesis, 

lonophores. polymerase inhibitors, channel blockers, nu- 
of ^ofi'^ T-"** compounds affecting polymerization 
of cyroskeleul proteins), there is a much larger number of 
m cresting chem.cally-induced elTects. most of them char- 
acterized by toxicologisis and pharmacologists in rodent 
s> stems. Just as a thorough genetic analysis would involve 
saturaung a genome with mutations, it is possible to ima- 
gine a saturating number of drugs, the analvsis of whose ac- 
tions would reveal the complete biochemistry of the cell 
While organized drug discovery efforts usually target spe- 
cific desired effects, the nature of the process, with its de- 
pendence on screening large numbers of compounds ne- 
cessanly produces many unanticipated effects. It is there- 
lore reasonable losuppos; that the required broad ranee of 
compoijnds necessar?- to achieve -biochemical saturation- 
may be forthcoming: in fact.it may already exist among the 
hundreds of thousands of compounds that failed to qualify 
as drugs. 



has been made in the development of mous- r., 
man hepatocyie culture sysiems.as well as ^tzirlt '"'^ 
tissue slices. Using such an arrav of technijues 
ble to assemble a matrix of mammalian svs'tem; inS '"^^ 
mouse and rat in vivo on one level and mouse ra, f '°'^-' 
man rn vino on a second level, and to compar; efff^ 
tween species and between svstems This annrL k ^ 
us to draw infomied conclusions r.L^Z Se bioS "''"^^ 

universality- ofbiologicalrespons:samonetheS^^^ 
and to offer some insight into the validlK of ^f^^'^'-- 
proaches for toxicological screenine. We bel° ,1' " 
will be necessary if ,« vi.ro alternatives are to acJii * 
usage in government-mandated safetv iestm° of 5 ' ' " 
sumer products and industrial and agriculm °f cSiSV 

A numberof interesting studies have been published , 
2-D mapping to examine effects in the rodent I S i ""^-^ 
ber of mvestigarors have made use of the tech^i^ " 
screen for existing genetic variantas-nir, '""""'i'; :.' 
lions {12-14]. mai4 in the mouse Thi wort' buiM ' 
wealth of genetic informaiiraS 1^ on ihJ ' ' °" 
its established position as a m\mm'J aSm'u uS-deS' 
tion system. While some studies of chemiSl eS A 

rat [18-23]. The examination of the cnochrome 0-450 « 



Among organs, the liver is an obvious choice for the study 
of chemical effects because of its well-known plasticity and 
responsiveness, -njc brain appears to be quite plastic (e.g 
u-r.r . J-^ " * complicated mixture of cell types requiring 
skillful dissection for most experiments. The kidnev, while 
quite responsive, also presents a potentially confounding 
mixture of cell types. The liver, by contrast, is made up of 
one predominant cell type which is easy to solubilize: the 
hepatocyte, representing more than 95% of its mass. Most 
importontly, the liver performs many homeostatic func- 
tions that require rapid modulatioii of gene expression It 
appears that most chemical agents tested affect gene ex- 

nlfnllfcil"^''!''''" Leigh Anderson, 

unpublished observations), an interesting contrast to our 
earlier work with l.vmphocytes. for example, which seem to 
./rjTn"^, "fPonsive.Such results conform to the expec- 
,^o?.?H^ * homeostatic. physiological role 

Should be more plastic than cells differentiated for a pur- 

Sgene? °" 

The liver also allows the parallels between in vitro and in 
VIVO systems to be examined in detail. Significant progress 



^°"^''l=^a"ons lead us to conclude that rodent 
offers the best opportunity to systcmaticailv examine d; 
array of gene regulation systems, and ultimateK to ih^" 
predictive model of large-scale mammaliTn g ne conw 
The basic underlying foundation of such a project is a rS."' 
ab e, reproducible master 2-0 pattern of live . o wikh o 

fenon'^h'™"'"' '""'^^ referred. In this paper.u" 

repon such a master pattern for the acidic and neutral pro- 

'^'^i'"''"'''^ t.ymapsofbas.cproieins.and analog- 
ous maps of mouse and human liver. 

2 Materials and methods 
2.1 Sample preparation 

Liver is an ideal sample material for most biochemical stud- 
ies, including 2-D analysis. A sample is taken of approxima- 
tely 0.5 g of tissue from the apical end of the left lobe of thf 
liver. Solubilization is effected as rapidly as practical; a 
oelay of 5- 15 mm appears to cause no major alteration in 
iver protein composition if the liver pieces are kept cold 
(e.?., on ice) in the interim. In the solubilization process, 
the liver sample is weighed, placed in a glass homogenizer 
{e.g., 15 mL Wheaton); 8 volumes of solubilizing solution* 

• The solubilizing joluiion is composed of 2% NP-40 (Siitni).* »* «'«' 
(analytical grade, e.g.. BDH or Bio-Rad). 0.5% diihiothre.iol (PTT: 
iigma ) and 2 % carrier ampholytes (p H 9- 1 1 LKB ■ these come as J -O"- 
siocic loluiion.so 2 % final concentration is achieved by making Ihtfi"*" 
solution 10% 9-1 lAmpholine by volume). A Urge batch of solubiluf 
(several hundred mL) is made and stored frozen at -80 °C in iliQ"** 
sumcieni to provide enough for one day s estimated sample pr«l-»" 
lion requiremeni. The solution is never allowed to become •'»n»« 
than room temperature at iny stage during preparation or Ihiwiol '* 
use. smce heating of concentrated urea solutions can produce coniaW 
nants that covalently modify proteins producing artifaetual *1P 
shifts. Once thawed, any unused solubilizer is discarded. 
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I (i.e., 4 mL per OS g tissue) and the mixture is ho- 
gpized using first the loose- and then then the tight-fit- 
y^act pestle. This takes approximately 5 strokes with 
^pestle and is earned out at room temperature because 
would crysialliie out in the cold. Once the liversample 
fljoroughly homogenized in the solubilizer. ii is assumed 
it til the proteins are denatured (by the chaoiropic cfTect 
the urea and NP-40 detergent) and the enzymes inacii- 
ttd by the high pH (-9 J). Therefore these samples may 
jtept ai room temperature until they can be centrifiiged 
frozen as a group (within several hours of preparation). 
,B samples arc cemrifuged for6 X 10* ^ min (rg.. 500 000 
^for 12 min using a Beckman TL-100 centrifuge). The 
aitrifuge rotor is maintained ai just below room lempera- 
je {e.g.. 15-20'C), but not too cold, so as to prevent the 
■edpiuiion of urea.The centrifuge of choice is a Beckman 
HOD because of the sample tube sizes available, but any 
Itracentrifuge accepting smallish tubes will suffice. "VVTicn 
1 appropriate centrifuge is not available near the site of 
unple preparation, samples can be frozen at — 80'C and 
awed prior to centrifugation and collection of supema- 
ints.Each supernatant is carefully removed following cen- 
■ifiigaiion and aliquoted into at least 4 clean tubes forstor- 
ge.Tbis is done by transferring all the supernatant to one 
lean tube, mixing this gently (to assure homogeneous 
cnnposition) and then dividing it into 4 aliquois. The ali- 
uots are frozen immediately at -80°C. These muhiple ali- 
uots can provide insurance against a failed run or a freezer 
reakdown. 

L2.Two-dimensional electrophoresis 

iL . ■ 

Jample proteins are resolved by 2-D electrophoresis using 

20 X 25 cm Iso-Dalt» 2-D gel system ([26-29]; pro- 
liSced by LSB and by Hoefer Scientific Instrumenu, San 
rrancisco) operating with 20 gels per batch. All first-dimen- 
iional isoelectric focusing (lEF) gels are prepared using the 
same single standardized batch of carrier ampholytes 
4-8A in the present case, selected by LSB's batch- 
nesting program for rat and mouse database work"*). A 10 
^Tsample of solubilized liver protein is applied to each gel, 
^ the gels are run for 33 000 to 34500 volt-hours using a 
progressively increasing voltage protocol implemented by 
programmable high-voltage power supply. An Ange- 
iique"' computer-controlled gradient-casting system (pro- 
duced by LSB) is used to prepare second-dimensional sod- 
iwn dodecyl sulfate (SDS) polyacrylamide gradient slab 
geb in which the top 5 % of the gel is 1 1 %T acrylamide, and 
tSTlower 95 % of the gel varies linearly from 1 1 % to 1 8 %T. 



system has recently been modified so as to employ a 
gnunercially available 30.8%T acrylamide/A',A''-methyle- 
oebisacrylamide prepared solution (thus avoiding the han- 
^g of the solid acrylamide monomer) and three addi- 
tional stock solutions: buffer (made from Sigma pre-set 
Tris), persulfate and A'.MA'.A'-tetramethylethylenedi- 
•igine (TEMED). Each gel is identified by a compuier- 
" pted filler paper label polymerized into the lower left cor- 
r^of the gel. First-dimensional lEF tube gels are loaded 

smitertal (fueceedinieenified batches of which ire available from 
"oefer Scientine Instrumeats) has the most linear pH tradient pro- 
uced by any ampholyte tested except for the Pharmacia wide range 
which has an unaccepuble tendency to bind high-molecular weight 
die proteins, causing them to streak). 



directly (as extruded) onto the slab gels without equilibra- 
tion, and held in place by polyester fabric wedges (Wed- 
gies", produced by LSB) to avoid the use of hot agarose. 
Second-dimensional slab gels are run overnight, in groups 
of 20, in cooled DALT tanks (lO'C) with buffer circulation. 
All run parameters, reagent source and lot information, 
and notations of deviation from expected results are ente- 
red by ihe technician responsible on a detailed, multi-page 
record of the experiment. 

12 Suining 

Following SDS-elecirophoresis, slab gels are suined for 
protein using a colloidal Coomassie Blue G-250 procedure 
in covered plastic boxes, with 10 gels (totalling approxima- 
tely 1 L of gel) per box. This procedure (based on the work 
of NeuhofT[30,31)) involves fixation in 1.5L of 50% etha- 
nol and 2% phosphoric acid for2h. three 30 min washes, 
each in 2 L of cold tap water, and transfer to 1.5 L of 34%i 
methanol, 17% ammonium sulfate and 2 % phosphoric acid 
for 1 h. followed by the addition of a gram of powdered Coo- 
massie Blue G-250 stain. Staining requires approximately 4 
days to reach equilibrium intensity, whereupon gels are 
transferred to cool tap water and their surfaces rinsed to re- 
move any paniculate stain prior to scanning. Gels may be 
kept for several months in water wjih added sodium azidc. 
The water washes remove ethanol that would dissolve the 
stain (and render the system noncoUoidal. with high back- 
grounds). The concentrated ammonium sulfate and meth- 
anol solution is diluted by equilibration with the water vol- 
ume of the gels to automatically achieve the correct final 
concentrations for colloidal staining. Practical advantages 
of this staining approach can be summarized as follows: (i) 
the low, flat background makes computer evaluation of 
small spots (max OD < 0.02) possible, especially when 
using laser densitometo", (») up to 1500 spots can be reli- 
ably detected on many gels (e.g., rat liver) at loadings low 
enough to preserve excellent resolution; and (iii) reprodu- 
cibility appears to be ver\' good: at least several hundred 
spots have coefficients of reproducibility less than 15%. 
This value is at least as good as previous CBB methods, and 
significantly better than many silver stain systems. 

2.4 Positional standardization 

The carbamvlaied rabbit muscle creatine phosphokinase 
(CPK) standards [32] arc purchased from Pharmacia and 
BDH. Amino acid compositions, and numbers of residues 
present in proteins used for internal standardization, are 
ukcn from the Protein Identification Resource (PIR) se- 
quence database (33). 



2.5 Computer analysis 

Stained slab gels are digitized in red light at 134 micron re- 
solution, using either a Molecular Dynamics laser scanner 
(with pixel sampling) or an Eikonix 78/99 CCD scanner. 
Raw digitized gel images are archived on high-density DAT 
tape (or equivalent storage media) and a greyscale video- 
print prepared from the raw digital image as hard-copy 
backup of the gel image. Gels are processed using the Kep- 
ler* software system (produced by LSB), a commercially 
available workstation-based software package built on 
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some of the principles of the earlier TYCHO system 134- 
41). Procedure PROCOOS is used to yield a spoUist giving 
position, shape and densit>- information for each detected 
spot. This procedure makes use of digiul filtering, mathe- 
matical morpholog>- techniques and digiul masking to re- 
move the background, and uses full 2-D least-squares opti- 
mization lo refine the parameters of a 2-D Gaussian shape 
for each spot. Processing parameters and file locations are 
stored m a relational daubase. while various log files detail- 
ing operauon of the automatic analysis software are ar- 
chived with the reduced data. The computed resolution and 
level of Gaussian convergence of each gel arc inspected 
and archived for qualit>- control purposes. 

Experiment packages are constructed using the Kepler ex- 
periment definition database to assemble groups of 2-D 
patterns corresponding to the experimental groups (eg 
treated and control animals). Each 2-D pattern is matched 
to the appropriate "master" 2-D pattern (pattern 
F344MST3 in the case of Fischer 344 rat liver), thereby 
providing linkage to the exisung rodent protein 2-D data- 
bases. The sofr*-are allows experiments containing hun- 
dreds of gels to be constructed and analyzed as a unit, with 
up to 100 gels displayed on the screen at one time for com- 
parative purposes and multiple pages to accommodate ex- 
periments of > 1000 gels. For each treatment, proteins 
showing significani quantitative differences vs. appropriate 
controls are selected using group-wise statistical parame- 
ters (e.g.. Student s t-iest, Kepler* procedure STUDENT). 
Proteins satisfying various quantitative criteria (such as P< 
0.001 difference from appropriate controls) are repre- 
sented as highlighted spots onscreen or on computer-plot- 
ted protein maps and stored as spot populations (/.e., logi- 
cal vectors) in a Lver protein daubase. Quantitative' dau 
(spot parameters, statistical or other computed values) are 
stored as real-valued vectors in the database. Analvsis of co- 
regulation is performed using a Pierson product-moment 
correlation (Kepler procedure CORREL) to determine 
whether groups of proteins are coordinately regulated by 
any of the treatments. Such groups can be presented graphi- 
cally on a protein map,and reported together with the statis- 
ucal criteria used to assess the level of coregulation.Multi- 
vanate statistical analysis (e.g., principal components' ana- 
lysis) IS perfonnaed on dau exported to SAS (SAS Institute). 



ceuticals. ground and mixed with the diet ai 
of 0.075 % and 1 %. respectively. Tlie ScS"'^^^'°'u 
was Punna 580IM-A (5% cholesterol plus 1 % 1'e= 
late in the control diet). Animal work was carrion 
crobiological Associates (Bethesda, MD) A^mtu"' 
climaiized forone week on the control diet 
trol diets for one week, and sacrificed on dav s l*"^""- 
daily doses of lovastatin and cholestyramine in annr 
groups were 37 mg/kg/day and 5 'g/ks/dax- 
based on the weight of the food consumed Live?."'" "'- 
were collected and prepared for 2-D electrophoresh ^"'^ 
ing to the standard liver protocol (homoe-ni,., 
volumes of 9 m urea.2% NP^q 0 5 "S-" f « 
LKB pH 9-1 1 carrier ampholytes, follow ^r^S 
tion for 30 mm at 80000 X g). Kidnej; brain and ' 
samples were frozen. Gels were run as d«cr,b,H Jk''"' 
and the data was analyzed using the Kepi"* sv«A,'r 
were scaled, to remove the effect of differences i^n T; " 
loading, by setting the summed abundai^Sa larJ^ 
ber of matched spots equal for each ger("neaVsc^,1nT- 

3 Results and discussion 
3.1 The rat liver protein 2-D map 



2.6 Graphical data outpnt 

Graphical results are prepared in GKS and translated 
withm Kepler* into output for any of a variety of devices 
Linedrawmg output is typically prepared as Postscript and 
printed on an Apple LaserWriter. Detailed maps presented 
here have been generated using an ultra-high-resoiution 
Postscript-compatible Linotronic output device. Greyscalc 
graphics are reproduced from the workstation screen using 
a Seikosha videoprinter. Patterns are shown in the sundard 
orientation, with high molecular mass at the top and acidic 
proteins to the left. 

2.7 Experimeat LSBC04 

In the study described here 12-week-old Charies River 
male F344 rats were used. Diets were prepared at LSB 
based on a Purina 5755M Basal Purified Diet. Lovastatin' 
and cholestyramine were obuined as prescription pharma- 



F344MST3 is a standard 2-D pattern of rat liver proteins 
based on the Fischer 344 strain. This pauern was initi! 
from a single 2-D gel and extensively edited in an «plr° 
mem conipanng it to a range of protein loads, so as to .n- 
elude both small spots and well-resolved representations of 
high-abundance spots. More than 700 rat liver 2-D oattems 
have been matched to F344MST3 in a series of drug effects 
and protein characterization experiments, and numerous 
new spots (induced by specific drugs, for instance) hav: 
been added as a result. A modified version including addi- 
t.ona! spots present in the Sprague-Dawlev outbred rat has 
also been developed (data not shown). Figure 1 shows a 
greyscale representation and Fig. 2 a schematic plot of the 
master pattern. More than 1200 spots are included, most of 
which are visible on typical gels loaded with 1 0 uL of solubi- 
hzed liver protein prepared by the standard method and 
stained with colloidal Coomassie Blue. Master spot num- 
bers (MSN's) have been assigned to all proteins, and ap- 
pear m the following figures, each showing one quadrant of 
the pattern. Figure 3 shows the upper left (acidic, high 
molecular mass) quadrant. Fig. 4 the upper right (basic, 
high molecular mass) quadrant. Fig. 5 the lower left (acidic, 
low molecular mass) quadrant, and Fig. 6 the lower right 
(basic, low molecular mass) quadrant. The quadrants over- 
lap as an aid to moving between them. The gel position (in 
100 micron units), isoelectric point (relative to the CPK in- 
ternal p/ standards) and SDS molecular mass (from the cali- 
bration curve in Fig. 8) are listed for each spot (Table 1). Be- 
cause of the precision of the CPK-p/ values, these parame- 
ters can be used to relate spot locations between gel s.vs- 
terns more reliably than using p/ measurements expressed 
as pH.A major objective of current studies is the identifica- 
tion of all major spots corresponding to known liver pro- 
teins, as well as rigorous definitions of subcellular orga- 
nelle contents. Of particular interest to us is the parallel de- 
velopment of identifications in the rat and mouse live' 
maps, allowing detailed comparisons of gene expression ef- 
fects in the two systems. The results of these studies will ^ 
presented systematically in a later edition of this daubase 
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Kn include here a useful series of 22 orienting ideniifi- 
gbosasanaid looiherusereofiheratlivcrpaiieni (Table 

m 

^[ 

l*C«b«njylaied cfaaree sundards. eompuled pfs mod 
■ molecular mass sundardizatioD 

febave previously shou-n thai the use of a system of dose- 
^aced internal pi markers (made by carbamylaiing a 
,ac protein) offers an accurate and workable solution to 
le problem of assigning positions in the p/ dimension [32]. 
he same system, based on 36 protein species made by car- 
ainylating rabbit muscle CPK. has been used here to as- 
ign p/'s to most rat liver acidic and neutral proteins. The 
undards were coelectrophoresed *-ith total liver protems, 
ud the standard spots added to a special version of the 
laster pattern F344MST3. The gel J-coordinaies of all 
ver protein spots lying within the CPK charge train were 
ben transformed into CPK pi positions by interpolation 
)etween the positions of immediately adjacent standards 
Table 1) using a Kepler* vector procedure. 

t has proven possible to compute fairly accurate pi values 
or many proteins from the amino acid composition (42). 
Ve have attempted here to test a funber elaboration of this 
ipproach. in which we computed p/" s fonhe CPKsiandards 
Jiemselves, based on our knowledge of the rabbit muscle 
CPK sequence and the fact thai adjacent members of the 
large train typically differ by blockage of one additional ly- 
ane residue (Table 3). We compared these values to similar 
computed pfs for an additional set of carbamylated siand- 
ffiis made from human hemoglobin beta chains and a se- 
nes of rat liver and human plasma proteins of known posi- 
fion and sequence (Fig. 7.Table 4). The result demonstrates 
good concordance between these systems. Two protems 
show significant deviations: liver fatty-acid binding protein 
(FABP; #1 in Table 4) and protein disulphide isomerase 
(CO in' the table). The FABP spot present on F344MST3 
may represent a charge-modified version of a more basic 
^nt spot closer to the expected pi, not resolved in the 
EF/SDS gel. Of panicular imporunce is the fact that, by 
Smparing computed pfs of sequenced but unlocated pro- 
Sns with the CPK pfs, we can assign a probable gel loca- 
Sbn without making any assumptions regarding the actual 
gel pH gradient. This offers a useful shortcut, given the va- 
garies of pH measurement on small diameter lEF gels. We 
ilve used this approach to compute the CPK pfs of all rat 
Sd mouse proteins in the PIR sequence database, as an aid 
protein identification (dau not shown). 

i.order to standardize SDS molecular weight (SDS-MW), 
j^have used a standard curve fitted to a series of identified 
Jjpteins (Fig. 8). Rather than using molecular mass perse, 
*e have elected to use the number of amino acids in the 
Wlypeptide chain, as perhaps a better indication of the 
' gth of the SDS-coaied rod that is sieved by the second 
'"ension slab. The resulting values were multiplied by 
(the weighted average mass of amino acids in se- 
enced proteins) to give predicted molecular masses. Be- 
"se we use gradient slabs, we have not constrained the fit- 
curve to conform to any predetermined model; rather 
tried many equations and selected the best using the 
gram "Tablecurve" on a PC. The equation chosen was y 
+ tjr+ c/x', where y is the numberof residues, jc is the gel 



r coordinate, a is 5 1 1 .83. is -0.2731 and r is 33 1 83 80 1 . The 
resulting fit appears to be fairly good over a broad range of 
molecular mass. 

3 J An example of rat liver gene regulation: Cholesterol 
metabolism 

Experiment LSBC04 was designed as a small-scale test of 
the regulation of cholesterol metabolism in vivo by ihree 
agents included in the diet: lovastatin (Mevacor«,an inhibi- 
tor of HMG-CoA reductase); cholestyramine (a bile acid 
sequesirant thai has the effect of removing cholesterol 
from the gut-liver recirculation); and cholesterol iiself.The 
first two agents should lower available cholesterol and the 
third should raise it, allowing manipulation of relevant 
gene expression control systems in both directions. Such 
an experiment offers an interesting test of the 2-D mapping 
svstem since most of the pathway enzymes are present m 
low abundance, many are membrane-bound and difTicult 
10 solubilize, and the pathway itself is complex. Approxima- 
tely 1000 proteins were separated and detected m liver ho- 
mogenates. Twenty-one proteins were found to be affected 
by at least one treatment, and these could be dwided into 
several coregulated groups. 

23.1 MSN 413 (puutive cjiosolic HMG-CoA synthase) 
and sets of spots regulated coordinately or inversely 

One group of spots (including a spot assigned to the cyio- 
solic HMG-CoAsynthase,MSN 413) showed the expected 
increase in abundance with lovastatin or cholestyramine, 
the synergistic further increase with lovastatin and choles- 
tyramine, and a dramatic decrease with the high cholesterol 
diet. Spot number 413 is the most strongly regulated pro- 
tein in the present experiment, showing a 5- to 10-fold in- 
duction after a 1 week treatment with 0.075% lovastatin and 
1% cholestyramine in the diet (Figs. 9 and 10). Us expres- 
sion follows precisely the expectation for an enzyme whose 
abundance is controlled by the cholesterol level; it is pro- 
gressively increased from the control levels by cholestyra- 
mine, lovastatin and lovastatin plus cholestyramine and it 
sinks below the threshold of detection in animals fed the 
high cholesterol diet. This spot has been tentatively identi- 
fied as the cytosolic HMG-CoA synthase, based on a reac- 
tion with an antiserum to that protein provided by Dr. Mi- 
chael Greenspan at Merck Sharp&Dohme Research Labo- 
ratories. This enzyme lies immediately before HMG-CoA 
reducuse in the liver cholesterol biosynthesis pathway, and 
is known to be co-regulated with it. Spot 413 has an SDS 
molecular weight of about 54 000 and a CPK pi of -1 M. in 
reasonably close agreement with a molecular weight of 
57300 and a CPK pi of -15.7 computed from the known se- 
quence of the hamster enzyme 143). 

Using a classical product-moment correlation test (Kepler 
procedure CORREL), a series of five additional spots was 
found to be coregulated with 413. The level of correlation 
was exceedingly high O 95%).Two of these. 1250 and 933. 
are at similar molecular weights and approximately one 
charge more acidic than 413 (Fig. 9). indicating that they 
may be covalently modified forms of the 413 polypeptide. 
This suspicion is strengthened by the observation that both 
spots are also stained by the antibody to cytosolic HMG- 
CoAsynthase.The remaining three correlated spots appear 
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was selected based on a regulaton paiicm close to th* in 
verse of that for spot 413 (MSN's 34 79, m m 2oJ Si?" 

spou probably represent addition^ enzyme? or lUbuSts' 
3J.2 MSN 235 and coregulated spots 

6^atlrSf^-^'7 'J'°''' ""^"^^ "°P"«'l of mitochon- 
drial proteins including putative mitochondrial HMG- 

CoA synthase spois.showed a modest induction by lovasS 
tm alone, but little or no effect with anv of the othe? 

rhXcl "P"''*^ '° only the regulation of enzvmes of 

packed riad at approximately 30 kDa, and are likelv to 
present isofonns of one protein. All thr«spo" are stained 
by an antibody to the mitochondrial fori^THMS-CoA 
synthase obtained from Dr. Greenspan. Subcellular fraci°i 
nation indicates a mitochondrial location iSe ShJffw^ 
spots (633 at about 38 kDa and 724^ abou?69 mY) ar^ 
each present at lower abundance than the memberlof the 
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proteins of the puuiive mitochondrial pathu 

inuch more variable in iheircxpression in a?S„ '- ^ 
amination of all the coregulated groups sug'^^^^^^^^^^^^ 
titative statistical techniques can extract a St r*'"'"- 
esiing information from large sets of rcprodS °^ 
abundance of spots in the 413 coregulat^Sou '^'^^ 
ple.shows an amazing level of concordat 
expression among the five individuals of the Jova 
cholestyramine treatment group This efTeci ic ^"^ 
differences in total protein foad.Sc.^h". ha?.' '° 
been removed by scaling, and since proteins Ihh ' 
ferent regulation patterns can be demonstrated ''if" 
13).Such effects raisethe possibility thTma"v«Ve? - ''^ 
lation sets may be revealed through the studv "r 
ciently large population of controPanima Je ,',!"""'- 
any expenmenial manipulation). This aoDroarh 
natural biological variation in proteh^ expreS: 
drug effects, offers an impoaam ince"n%Tr The ^onsf °' 
tion of a large hbran- of control animal patterns 

4 Conclusions 

^'emi^rv"^'*!,' ''■''"P'"'' °f both basic bio- 
chemisto- and in toxicology, there is a long-term need t 
comprehensive database of liver proteins T^/r^,"elL 
tcr pattern presenied here has proven to be an ac-ur.r 

than 700 gels to date. As the number of proteins idenXd 
and the number of compounds tested for gene eiprJ ln 

iTJLlu ' 7 "''"^ '"^^ ''^^^^^ to comrSuMX 
eri Lr af oV^^ If °' P"«i=^ "til.ty .n sev 

monstrated ^'^^'y '"''"^ 
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3.3 J An example of u aati-synergisUc effect 

^jr.«,'h°' PfJ? induction by lovastatin 

two- to threefold), and about half as much induction wth 

rnai-animal heterogeneity pattern of the 235-set (Fig 13) 
S cXS of^in represents the ci 2: 

SolesmamiSl of lovastatin and 

cnoiesiyramine. The existence of such an effect riPmo„ 

siveiy through the same regulatory pathway. 

3J.4 Complexity of the cholesterol synthesis pathway 

vLu'Jiirf inl!!';'*'"^'""''' ^"^tnent with 10- 

^Says us°L HMG r Tr'" ^"'^ """ochondriS 
mh-Zh ^ t ■ Cholestyramine on the 

o her hand, either alone or in combination with lovSin 

EufntuVoVio?^'";"^ "V"' '^'''^''^^ cytosoHc paThway! 
way An e«r»n5 V°° mitochondrial path- 

onlTe°^SJr°"' cholestyramine should if?ec1 
oUn^hi^ Si''' ^1'^*^''***'^««'>'"°^™"»d by cholester- 
ol and bile acid levels. It remains to be explained why some 
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^ F.gurr 7. (a) Plot of computed isoelecir.c pomi versus gel >-posi„on (o- 
two sets of carbamylaied standard proteins (rabb.l muscle CPK 1*1 in- 
human hemoglobin S chain, filled diamonds) and several other proicnl 
(shaded squares), (b) The identities of the vanous protems represeniti 
by the squares are indicated by the numbers in correspondmg posuion, 
on (a); these refer to Table 4. 
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uff 9. Montage showing effects in lb* 
region of MSN :4 1 3. The montage sho" • 
small window into one portion of the J-D 
pattern, one row of windows for each exjv 
rimental group, and one panel forei*!*' 
in the experiment. Tlie lefi-most paiiew 
in each row is a group-specific cop.' of 
master pattern followed by the pinen" 
for the five individual rats in the frooP- 
The highlighted protein spots (filled cirf 
les) are spot 4 1 3 (on the nghi of each pi^ 
el; identified as cytosolic HMG-CoAfl* 
lhase) and two modified forms of ii ('^ 
and 933). From the top, the rows (exj*"*- 
mental groups) are: high cholesterol. 
trols, cholestyramine, lovasiatin.ani 
sutin plus eholettyramine. 
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Fifure 10. Barguph showing ihe quaniiia- 
live efTecis of virious ireaiments on the 
abundance orMSN:413 (cyiosolic HMG- 
CoA synthase) in the gels of Fig. 9. 




Figure 11. Bargraphs of a series of six core- 
gulated spois including MSN:413. In the 
bargraphs. the abundances of the appro- 
priate spot (misierspoi number shown ai 
the lop of the panel) in each animal are 
shown. The five rive-animal groups are in 
the order (left to right): high cholesterol, 
controls, cholestyramine, lovastatin, and 
lovuutin plus cholestyramine. Each bar 
within a group represent* one experimen- 
tal animal liver (one 2-D gel). Note the cor- 
related expression of the 6 spots, espe- 
cially in the two far right (most strongly in- 
duced) groups. 
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Figure i:. Daia on a second coreguljir; 
frourofspois.prfseni;!! as in Fig. II T:. 
founh cxperimenial group dovasia;:: 
shows a modes; induction, while the fifir 
group (lovasiaiir plus cholesiyramint ■ 
does not. 



367 





Figure 13. Data on spot MSN:367, presented as in Fig. 1 1. This 
shows unanubiguously the inti-synergiitic efTecl of lovastatin an'' 
tyramine (fifth group) as compared to lovasutin (fourth group) 
ponse contrasts strongly with the regulation pattern seen in Fig '•• 
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1 IDS 


n A 


•M Tl W> 

Z3./UU 


197 


667 




-CU. 1 


C9 eftn 


196 




Ml 


>v.U 


AJ c/v\ 
*A,^O0 


199 


1711 


674 


_9 9 


AA Dnn 
•4 .HUD 


200 


872 


424 




OD.UUU 


201 


292 


435 






202 


736 


£9.9 


.1 A 


^ A7 onn 


203 


786 


829 


-1 K 7 
•ID./ 


97 Afr\ 
•3 / 


204 


1224 


589 




K/\ onrt 


205 


439 


963 




J 1 . tlM 


206 


1994 


571 


>u.u 


oi . JOO 


207 


1895 




-O.J 


44.200 


Aw 






<-J!>.U 


15.800 


210 


1 700 




9 9 

-Z.J 


57.000 


211 


902 


51 7 


-1 1 


55,400 


213 


1067 


Aftl 


-1 u.* 


44,400 


214 


1340 


ooo 


9 n 


45.200 


215 


1581 




-J .9 


57.300 


216 


1585 


755 


-J.C 


40.700 


217 


1159 


393 


-9, J 


CO 9ftn 
GP9.JUU 


218 


831 


572 


-13 5 


fti 9fm 


219 


713 


177 


-1 8.7 


170 500 


220 


1479 


91 1 


-4.9 


33 900 


221 


965 


927 


-1 2 B 


33 300 




934 


716 


-13.5 




225 


1812 


1045 


-1 0 


9n MY) 


226 


821 


41 1 


-1 SB 


66 800 


227 


1566 


14&3 


-3.6 


13 600 


22B 


1065 


567 


•10 8 


fti IWi 

9 1 


229 


1577 


890 


•o. / 


•U ttVI 


230 


1456 


496 


-ft 9 


K7 9IV^ 

d/,JW 


232 


1440 


849 


-ft ft 


•uc c/v^ 


234 


1682 




9 A 


C9 

3/.9UC 




618 




99 n 
-£a.v 


9/1 9/V1 


236 


920 


1 138 


.n 7 

- 1 o . / 


9ft Af^ 


237 


952 


1 006 


-1 T 1 

- 1 0. 1 






161 1 


1 


9 9 


K9 C/V1 






TOO 




.J9 C/V\ 






AAA 


97 7 


4-9 1 no 


241 


1620 


569 


"V.V 


fti AT^ 


242 


1357 


656 


•6 8 


Aft B/V1 
43.C1UU 


243 


71 1 


1 182 


-16.7 




244 


1855 


621 


^ 6 




245 


1169 


474 




ftO 9/Y^ 


246 


551 


459 


-25 1 


D 1 ,IAA/ 


247 


1348 


604 


,8 9 


49, 1IA/ 


248 


460 


448 


-29.3 


62.100 


249 


1733 


451 


•1.9 


61,800 


250 


1974 


788 


>0.0 


39.200 


251 


806 


392 


-16.1 


69,500 


252 


874 


553 


-14.6 


52.500 


253 


753 


848 


•17.6 


36.500 


254 


995 


450 


•12.1 


61,900 


255 


1690 


679 


-24 


44,600 


256 


994 


1006 


-12.1 


30.200 


257 


508 


464 


-274 


60,400 


256 


1517 


820 


•44 


37.800 



■uster ubIe of proteins in the nt liver daubase. showing spot master number.jel position {x«nd y). isoelectric point reliiive to CPK sundards. and 
predicted molecular mass (from (he sundard curve of Fit. 8)- 



924 



L Aadttioe tisL 



MSN 



^ CPKol SOSMW 



y CPKdI SOSMW 



2S0 
260 
261 
262 
263 
265 
266 
267 
268 
268 
270 
271 
272 
274 
275 
276 
277 
278 
278 
281 
282 
283 
284 
285 
286 
288 
289 
290 
291 
292 
293 
294 
295 
296 
297 
299 
300 
301 
302 
303 
304 
305 
306 
307 
306 
300 
310 
311 
312 
313 
314 
315 
318 
320 
321 
322 
323 
324 
325 
326 
327 
328 
330 
331 
332 
333 
334 
335 
336 
338 
339 
340 
341 
343 
344 



1796 
661 
1725 
486 
10S3 
1390 
510 
660 
430 
1044 
2019 
857 
895 
1292 
1350 
1670 



861 
1361 

67B 
1127 

172 

673 
437 
1038 
861 
806 
853 
422 



961 
878 
1848 
1505 
1313 
1314 
1332 
1277 
1391 
1147 
925 
787 
1462 
531 
860 
1162 
218 
1377 
913 
2012 
702 
494 
403 
1843 
1049 
1606 
1210 
1627 
1524 
1769 
1609 
266 
1902 
1316 
1341 
1104 
1480 
850 
1454 
670 
655 
1521 
1587 
1388 



712 
580 

loao 

538 
718 
570 
1084 
525 
1147 
829 
408 
652 
824 
578 
511 
1476 
816 
44S 
688 
609 
814 
97B 
1523 
667 
178 
1280 
1008 
158S 
583 
980 
016 
755 
892 
102B 
1451 
1408 . 
1365 
1395 
523 
1053 
1459 
603 
1484 
626 
101 
675 
677 



1608 
1566 
531 
784 
1050 
1593 
1616 
1854 
1265 
581 
1497 
1351 
1813 



1291 
751 
607 
471 
11S6 
407 
303 
598 
1004 
888 
585 
1047 
265 
540 



-1.1 
-20.4 
•2.0 
•28.0 
•10.9 
•6.3 
-27J 
-20.4 
•31.0 
-11J 
>0.0 
•15.0 
-14J 
-7.6 
-6.9 
•i6 
-1S.4 
-13.0 
-14.5 
-0.7 
-<.6 
-7J 
-7.3 
•7.1 
-7.8 
-6.3 

-e.5 

-13.6 
-16.6 
-5.1 
•26.3 
-14.9 
-6.3 
<-3S.O 
-6.5 
-13.9 
>0.0 
-19.0 
-28.1 
-32.6 
-0.7 
-11.1 
-3.3 
-8.5 
-3.0 
-44 
•1.5 
-3.3 
<-35.0 
■0.3 
-7J 
-7.0 
•10.1 
-4.9 
-15.1 
-5.3 
-20.0 
■20.6 
•4.4 
-3.6 
-6.3 
-30.0 
-3J 
-3.8 
-26.3 
-16.7 
-10.9 
-3.5 
-3.2 
-0.6 
-8.0 
•23.6 
-4.7 
-6.8 
-0.9 



31.800 
17.7D0 
44.800 
25.800 
177.400 
45.000 
63.400 
29.000 
31.900 
48.900 
36^100 
65^ 
31,700 
4Z900 
48,900 
27.100 
53,700 
42.600 
51,300 
27,300 
54.800 
25.100 
37.400 
67,200 
46.100 
37.600 
50.700 
55.000 
13.900 
37.800 
62.000 
43.600 
48.700 
38,000 
31,300 
12.400 
45.300 
168,200 
20.400 
30.100 
10.300 
49.800 
30.900 
33.700 
40.700 
34.700 
29.400 
14,700 
16,100 
17,600 
16.600 
54.900 
28.500 
14.400 
49,100 
13.300 
47.700 
420.500 
44.800 
44,700 
67,000 
20,100 
40,000 
43.700 
50,600 
24.700 
67J00 
88.500 
49.400 
30,300 
34.000 
50.300 
28,700 
102.200 
52.800 



345 
346 
347 
348 
340 
350 
351 
352 
353 
3S4 
355 
356 
357 
356 
350 
360 
361 
362 
363 
364 
365 
366 
367 
368 
369 
370 
371 
372 
373 
374 
375 
376 
377 
378 
379 
381 
382 
383 
384 
385 
366 
367 
386 
389 
300 
301 
392 
303 
304 
395 
396 
397 
399 
400 
401 
403 
404 
405 
406 
400 
410 
411 
412 
413 
415 
416 
417 
418 
419 
420 
421 
422 
423 
424 
425 



1006 
1005 
625 
361 
110 
521 
012 
1574 
061 
706 
1450 
1374 
474 
798 
764 
1384 
1713 
1161 
914 
412 
741 
878 
1560 
983 
434 
639 
1587 
1875 
1351 
1506 
1823 
254 
1409 
621 
1017 
953 
856 
12S2 
1699 
1042 
1490 
1554 
1193 
1374 
14S£ 
718 
1799 
1482 
1227 
1530 
1410 
912 
1465 
1473 
1029 
1516 
1495 
1525 
723 
650 
1501 
936 
350 
1033 
737 
1576 
646 
1695 
725 
1289 
1171 
599 
929 
739 
1400 



578 
640 

728 
983 
1343 
1130 
610 
530 
912 
762 
830 
1152 
807 
346 
338 
1066 
769 
859 
1156 
435 
486 
1503 
935 
520 
441 
610 
860 
762 
1059 
715 
532 
417 
583 
494 
595 

sae 

674 
256 
1516 
493 
563 
603 
404 
902 
969 
690 
732 
758 
1461 
577 
755 
256 
1063 
450 
1140 
754 
554 
1002 
252 
663 
478 
1057 
1120 
536 
425 
606 
496 
4S2 
770 
1041 
912 
162 
856 
625 
965 



-11.9 
-10.3 
-21.7 
-35.3 
<-35.0 
•26.7 
•13.9 
-3.7 
-12.9 
•18.9 
•5.3 
-6.5 
•28.7 
-16.3 
-17.3 
-6.4 
-2.1 
•9.3 
-13.8 
-32.0 
-17.0 
-14.6 
-3.9 
-124 
-31.0 
-21.2 
-3.6 
-0.5 
-6.fi 
-4.6 
-0.9 
<-35.0 
-6.1 
-21.8 
•11.7 
-13.1 
-15.0 

-e.1 

-2.3 
-11.2 
-4.7 
-« 0 
•6.9 
-6.5 
-5.2 
-18.5 
-1.1 
-4 8 
•64 
-4.3 
-6.0 
-13.9 
-50 
-4.9 
-11.5 
-4 4 
-4.7 
-4.3 
-184 
•20.8 
-4.6 
-13.4 
•35.B 
-114 
-18.0 
■3 7 
•21 C 
-2.3 
-18.3 
-7.7 
-9.1 
-22.8 
-13.6 
-17.9 
-4.7 



50.800 
46.800 
42.000 
31.100 
18,300 
25.700 
48.100 
54.300 
33.800 
40,400 
37.300 
24.000 
30.600 
77.800 
70.400 
27.000 
40.100 
36,100 
24,800 
63.700 
58.200 
13.000 
33.000 
55.200 
63,000 
48,700 
36,100 
40,400 
28.300 
42.700 
S4.200 
65,900 
50.400 
57.500 
49.600 
49.400 
44.S0O 
105,300 
12.500 
57.500 
50.400 
49.100 
67.700 
34.300 
31,700 
44,000 
41.900 
40.600 
14,400 
50.800 
40.800 
106 400 
28.100 
61.900 
25.300 
40.800 
52.500 
27.100 
108,000 
45,500 
59.000 
28.300 
26,000 
53.700 
64,900 
48,900 
57,300 
56.600 
40,000 
28.900 
33.900 
193.700 
36.200 
47.700 
31.800 




1 !*> 



426 
427 
428 
439 
430 
431 
432 
434 
435 
436 
437 
436 
439 
440 
441 
443 
446 
447 
448 
449 
450 
451 
452 
453 
454 
456 
457 
459 
460 
461 
462 
463 
464 
465 
466 
468 
469 
470 
471 
472 
473 
474 
475 
476 
477 
47B 
479 
480 
482 
463 
485 
486 
487 
486 
489 
490 
491 
492 
493 
494 
495 
406 
497 
499 
500 
501 
502 
503 
504 
505 
506 
507 
508 
509 
510 



t 

810 

1565 
1250 
1253 
734 
483 
518 
1020 
1122 
1870 
435 
86 
1740 
599 
743 
801 
1050 
1245 
1576 
1818 
1094 
1045 
1652 
1403 
1394 
905 
1038 
1598 
1528 
1098 
849 
1814 
1388 
1194 
577 
1140 
1797 
1293 
618 
2009 
1205 
1035 
160 
469 

1009 
1216 
816 
6S3 
1606 
47B 
1025 
1045 
1609 
775 
692 
1100 
1760 
882 
470 
494 
680 
1414 
1234 
1246 
824 
1246 
1115 
1189 
1578 
787 
979 
1153 
1730 



7D4 
843 

303 
847 

562 
1426 
433 
1041 
1170 
196 
673 
1102 
847 
544 
1571 
335 
668 
926 
1298 
1516 
1021 
440 
802 
8»4 
500 
718 
436 
581 
294 
863 
1137 
1125 
1072 
481 
1064 
467 
886 
524 
1133 
655 
299 
215 

7se 

155 
1370 
662 
540 
235 
346 
673 
1013 

607 
1186 
301 
1289 
178 
964 
776 
247 
1258 
1436 
852 
546 
1072 
659 
792 
1134 
1407 
391 
402 
250 
552 
619 
1006 



7.* 
•16.0 
•3.9 
-8.0 
-81 
-181 
■285 
•26.9 
-11.6 
■08 
-0.5 
-31.0 
<-35.0 
■1.8 
■22.8 
-17.8 
■16.2 
■11.1 
■8.2 
■3.7 
-0.9 
■10.3 
>0.0 
-2.8 
-6.1 
-6.3 
-14.0 
■11.3 
-3 4 
-4.3 
•10.2 
■15.2 
•0.9 
-6.3 
-8.9 
-23.9 
-9.6 
•1.1 
•7.6 
•21 .9 
>0.0 
■8.7 
-11 4 
<-35 0 
-28.9 
■22.8 
-11.8 
-86 
■15.9 
■19.3 
-3.3 
■28.6 
■11.5 
-11.2 
-3.3 
-17.0 
-19.3 
-10.2 
•1.6 
■14.5 
-28.9 
-28 1 
-12.5 
•60 
■83 
-8.2 
-15.7 
-6.2 
•99 
-8.9 
-3.7 
•166 
-12.5 
•94 
•20 



36.^ 

1S.S<t 
6a.9Et 
28.«ac 

147.gc( 

*isxt 

26.7CC 
36.eBC 
MJBS 
10.«X 
80.1B 
4SJK 
33.30C 

iseec 
29.ea: 

63,1K 
38.aK 

34.eoc 
S6gir 

4^800 
63.SK 
50.500 

9i.4ac 

35.9K 
25*1 
25.8ac 
27.a0B 
56.700 
27.300 
60.100 
34.900 
54,800 
25.500 
46.000 
89.900 
131.300 
39.3QC 
207,600 
17.400 
45.800 
53.500 
117.400 
77.800 
44.900 
30.000 
49.300 
48.800 
23.700 
89.200 
20.10D 
169,300 
31.800 
39.700 
110.700 
21J0D 
15,200 
36.400 
531100 
27.800 
4i70O 
39.00C 
25.SDC 
16200 
68l1« 

6&av 

f 00.(0 
52.«» 

4a.i« 



800 

(looe 

[l606 
948 
481 
i-1334 
868 
798 
822 
632 
(.1332 
803 
1-1190 
470 
768 
747 

hiro 

1502 
1728 
507 
870 
i1347 
(••1513 
308 

nasi 

1463 
It 000 
S25 
1164 
803 
1259 
856 
803 
1162 
128 
r1355 
805 
[1368 
802 
b1125 
70S 
1477 
980 
700 
.1028 
808 
788 
777 
980 
[1518 
k1212 
780 
818 
^1142 
S32 
771 

Lioea 

^822 
914 
1064 
1S24 
^1302 
982 
t1487 
758 
887 
930 
^1888 
842 
L"17 
65 
1014 
^732 
L1827 
[1000 



ia 1991. 12. 907-9)0 



D«ub«u of rat liytr proicini 



925 



Y CPKot SOSMW 



Sti BOO 4iM .ie.O S0.«O 

512 1099 533 -lOJ 54.100 

51J 1696 1034 .2J 29J00 

514 946 636 -13.2 47.100 

515 461 543 -28.5 53 400 

516 1334 1044 -7.1 26.600 

517 666 1021 .14.6 29.700 
516 796 779 -16.3 39.600 
519 822 670 -15.7 45100 
S!D 632 165 -21.5 189.000 

521 1332 S30 -7.1 37J00 

522 603 1104 -22.6 26 600 

523 1190 309 -6.9 86 600 

524 479 1226 -28.6 22.300 

525 768 1066 -17.2 28 000 
S2S 747 1016 -17.7 29.600 
5Z7 1170 231 -9.2 119600 
S2B 1S02 542 ^.6 53.400 
530 1728 520 -2.0 46 000 

532 507 1011 -27.4 X.OOO 

533 670 469 -14.7 57.900 

534 1347 1065 -6.9 27 300 

535 1513 346 -45 77 800 

536 306 654 <.35.0 46 000 
536 1851 669 -0.7 44 100 

539 1463 ge2 -5 1 3i loO 

540 909 561 -13.9 52.000 

541 625 289 -21.7 93 100 

542 1164 196 -9.2 146^00 

543 803 655 -16.2 45 900 
54* 1259 1143 -B.O 25 200 

545 656 1526 -15,0 12.200 

546 803 1071 -16.2 27 800 

547 1162 274 -9.3 98.400 

548 128 1321 <-35.0 19 000 

549 1355 1122 -6.8 a 900 

550 595 866 -23.0 35 800 
SB 1369 494 -6.6 57 500 
SO 992 405 -12.2 67.600 
55 1125 410 -9.8 66 900 
M 705 975 -16.9 31 400 
557 1477 1030 ^.9 29 300 
556 980 583 -12.5 50 400 
5a 700 1109 -19.1 26.400 
SSO 1028 621 -11.5 48!ooO 
562 898 794 -14.1 36 900 
564 789 1446 -16.6 14.900 
*S 777 766 -16.9 40^200 
S66 990 328 -125 81.900 
*7 1519 611 -4.4 46 600 

1212 661 ^.6 45600 

WO 760 594 -17.4 49700 

E'l 618 956 -21.9 3Z100 

"3 1142 771 -9.6 40 000 

57* 532 787 -26.2 39.300 

^ 771 250 -17.1 109200 

576 1066 534 -10.8 54.100 

^ 822 734 -15.7 41.800 

!2 914 754 -13.8 40.800 

5?« 1064 794 -10.8 38.900 

MO 1524 714 ^.4 42.800 

* 1392 783 -6.3 39.400 

982 686 -12 4 44JO0 

1467 672 -4.6 45.000 

5« 758 731 -17.4 41.900 

2! 687 1152 -195 24.900 

?2 930 523 -135 55.000 

S 774 -0.4 39.900 

?« 642 465 -21.1 58.300 

»0 1317 519 -7.3 55,300 

1^ 65 1 548 <-35.0 11.500 

S 614 -11.7 48.400 
^ 732 176 .18.1 17i300 

»» 1627 476 -3.0 59.000 

1009 1426 -11.8 15.500 



USN 



Y CPKol SOSMW 



596 619 269 -21.9 100 500 

587 1176 461 -9, eoTOO 

596 14&5 1044 -50 28600 

599 741 1188 -17.9 23 600 

600 907 402 -14.0 66 000 

601 667 656 -195 45800 
608 712 1136 -18.7 25400 

603 896 181 -14.1 165.200 

604 783 1461 -16.7 14.400 
606 736 223 -18.0 125 300 
606 629 273 -21.6 96 700 
»7 1064 286 -10.8 94 000 
606 663 503 -14.5 56 700 
606 2012 610 >0.0 46 700 
610 12S5 903 -.8.1 34 200 

612 1103 391 -10.1 69 600 

613 778 265 -16.9 102 000 

614 -824 518 -15.7 55 400 

615 1095 195 -103 149 100 

616 1 759 478 -1.6 seoOO 

617 994 372 -12.1 72 900 

618 751 374 -17.6 72.400 

619 1429 516 -5.7 55 300 

620 1 050 520 -ill 55 '200 

621 923 1105 -13 7 26 600 

622 1462 622 -5 1 47 900 

623 759 225 -17.4 1 24.000 

624 756 1038 -174 29 000 

625 1438 606 .5.5 48900 

626 1096 1089 -10 2 27 200 
527 942 548 -133 y'ooo 

628 809 621 -16.0 46.000 

629 899 979 -14. 1 31300 

630 1135 1321 -9.6 ig'lOO 

631 979 615 -12.5 48 300 

632 1542 1076 -4 1 27 600 

633 1345 814 -6.9 38*000 

634 409 950 -32.2 32400 

635 1165 704 -9.2 43 300 

636 774 604 -17.0 49 000 

637 1263 524 -8 0 54 800 

638 952 411 -13.1 66 700 

639 1717 575 -2.1 51 000 

640 994 292 -12.1 92 000 

641 165 1 224 <-35.0 22.400 

642 803 251 -16.2 lOe^SOO 

643 719 296 -18.5 90 700 

644 1100 294 -10.2 01 400 

645 534 1263 -26.1 21 000 

646 1153 1038 -9 4 29 000 

648 1246 204 -6.2 140.000 

649 14 1406 <-35.0 16.200 

650 1713 1049 -2.1 28 600 

651 1966 1183 >0 0 23,800 

652 1376 816 -6 5 36 000 

653 1442 1165 -55 24 400 

654 650 806 -20 8 38 400 

655 1111 551 .,0.0 52.700 

656 1095 861 -10.3 36 000 

657 1524 540 -44 53 600 
656 1777 860 -1 4 36 000 

659 391 564 -33 4 50 400 

660 977 565 -12.5 51 700 

661 656 166 -20.5 187,500 

662 732 312 -18 1 86 100 

663 1 787 567 -1.2 51 500 

664 888 268 -14,4 100 9CX5 

665 889 775 -14.3 39.'800 

666 715 221 -18.6 126.300 

667 781 227 -16,6 122 400 
666 646 165 -21.0 189^00 

669 1116 353 -9.9 76.300 

670 1362 643 -6.4 46 600 

671 547 789 -25.3 39;200 
673 964 746 -12 4 41.200 



MSN 



Y CPltol SOSMW 



674 1661 

675 1523 

676 706 

677 919 

678 lOBS 

679 600 

680 1237 
661 1103 

682 1406 

683 1596 

664 555 

665 1167 1313 
686 1932 790 



562 
642 
615 
SSI 
923 

loot 

283 
477 
249 



667 1545 

668 1456 

689 1011 

690 1995 

691 812 

692 1 1 54 

693 1993 

694 1628 

695 928 

696 1854 

697 1997 
696 957 
699 1 540 

702 577 

703 1610 

705 1278 

706 1641 



•2.7 62.100 

-4.4 51.800 

-18-8 46.700 

-13.7 48.300 

-10.5 52.700 

'22.7 33,400 

■6,3 M,300 

-10 1 95.100 

■6.1 59.100 

•34 109,800 

-24.8 43.500 

■9.2 19.300 

0 0 39.100 



707 1018 1386 

709 1074 1145 

710 293 889 

712 720 412 

713 1386 841 

714 1328 263 

715 698 433 

716 701 481 

717 1875 699 

718 575 702 

719 1216 204 

721 1069 464 

722 1 272 506 

723 956 822 

724 763 395 

725 720 916 

726 1476 415 

727 1646 473 

728 510 783 

729 1217 1126 

730 1856 724 

731 665 765 

733 1321 312 

734 719 427 

735 1101 473 

736 1359 569 

738 696 220 

739 687 409 

740 1205 256 

741 995 563 

742 896 596 

743 881 181 

744 1951 666 

745 726 168 

746 999 643 

748 182 1503 

749 2005 649 

750 1448 575 

751 792 266 

752 469 296 

754 664 254 

755 1195 164 

756 1821 1113 

757 909 246 
760 790 133 



619 -4,1 48.100 

764 .5.2 40.300 

953 -11.8 3Z300 

270 >O.0 100.200 

866 -16,0 34.900 

1461 -9 4 14 400 

819 >0.0 37^800 

656 -3.0 45.900 

254 -13.6 107.000 

715 -06 42.700 

345 >0.0 78.000 

563 -13.0 . 51.800 

730 -4.2 42.000 

900 -23.8 34,400 

562 .3.2 51.900 

571 -7.8 51.200 

704 -0.7 43.300 



•117 16.900 

•10.7 25100 

<-35.0 34.800 

-18.5 66.600 

-6 4 36.800 

-7.1 103.100 

•19.1 63.900 

■19.0 58.700 

■0.5 43.600 

-23.9 43.400 

-6.6 140.400 

-10.8 60,400 

-7.9 56.400 

■13.0 37,700 

■17.3 69.100 

-18.5 33.700 

-4.9 66.200 

-0.7 59,400 

-27.3 39.400 

-6 6 25.800 

•0-6 42,300 

-20.2 40.300 

■7.2 85.900 

-18,5 64.600 

-10,2 59.500 

-6.7 51.400 

-19.2 127.600 

-195 67,000 

■67 106.200 

■12,1 51,900 

-14 1 49.500 

-14.5 165.900 

>0.0 44,200 

-18.3 163,600 

-12.0 46.600 

<-35.0 13.000 

>0,0 46.300 

■5 4 51,000 

-16.5 101.900 

-2S.9 90.600 

-20.3 107.000 

-8.8 161.000 

-0.9 26.300 

-13.9 111,000 

-16.5 264,900 



926 



M9J 



Y CPKol SOSUW 
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MSN 



761 
763 
764 

765 
766 
767 
768 
768 
770 
771 
773 
775 
776 
777 
778 
77S 
780 
7S4 
785 
786 
787 
7B0 
781 
7B2 
783 
784 
786 
797 
798 
799 
800 
801 
802 
803 
804 
805 
806 
807 
808 
809 
810 
811 
812 
813 
814 
815 
816 
817 
818 
819 
820 
821 
822 
823 
824 
825 
826 
827 
828 
830 
831 
832 
833 
834 
837 
838 
839 
840 
841 
842 
843 



1388 
1416 

2Q20 
651 
1052 
1968 
1330 
1B7D 
857 
1337 
1576 
865 
1438 
1S39 
850 
700 
1052 
1413 

1364 

1822 
883 
616 
451 
777 

1536 

1461 
388 

1126 
933 

1420 

1759 
624 



1775 
573 
203 
960 
802 
625 

1851 



845 
B4« 
847 



1358 
851 
745 
2028 
1066 
629 
1376 
1771 
1045 
984 
1712 
1256 
1517 
1442 
1240 
1309 
2012 
937 
1342 
562 
1073 
481 
501 
751 
635 
1494 
1952 
1585 
571 
1325 
1727 
630 
2016 
673 



733 

loss 

568 
475 
1149 
468 
685 
613 
617 
974 
502 
824 
706 
458 
434 
411 
1136 
529 
885 
835 
382 



1429 
377 
1543 
807 
546 
212 
437 
533 
279 
865 
547 
1468 
196 
4S4 
1039 
308 
B27 
1015 
573 
24B 
393 
1246 
810 
645 
313 
1177 
790 
263 
362 
279 
205 
6S4 
449 
513 
1014 
708 
1405 
756 
826 
1039 
820 
581 
748 
833 
459 
301 
1080 
1312 
649 
301 
679 
905 
1200 



•6^ 
•5.S 
>0.0 
-20.8 
-11.1 
>0.0 
-7.1 
>0.0 
-15.0 
-7.0 
-3.7 
-12.8 
-5.5 
-4.2 
-15.1 
-19.1 
-11.1 
-6.0 
-6.7 
•0.9 
-14.3 
-22.0 
-29.8 
-16.9 
-4.2 
-5.1 
-33.6 
-98 
-13.5 
-59 
-1.6 
-21.7 
-14.2 
-1 4 
-24.0 
<-35.0 
-12.5 
-14.1 
-21 .7 
-0.7 
-30.9 
-68 
•15.1 
-17.8 
>0.0 
-104 
-21.6 
•6.5 
-14 
-11.2 
-12.4 
-2.2 
-6.1 
•4 4 
-5.5 
-8.3 
-7.4 
>0.0 
-13* 
-7.0 
-24.5 
-10.7 
-28.5 
-27.8 
-17.6 
-21 .3 
-4.7 
>0.0 
-3.6 
-24.1 
-7.2 
-2.0 
-21.5 
>0.0 
-18.9 



4^JtO0 
Z7XO 
51.400 
58.300 
25.000 
59.900 
44.300 
48.500 
48^ 
31^ 
56.700 
37.600 
43.100 
61.000 
63.800 
66.800 
2S.S00 
54.400 
35.000 
37.100 
68.500 
35.100 
15.400 
72.000 
11.700 
38.300 
53.100 
133.700 
63.400 
49 800 
96.500 
35,800 
53.000 
14.200 
148.400 
57.400 
29.000 
87.200 
37.500 
29,900 
51.100 
109.700 
69.400 
21.600 
38.200 
46.500 
85.700 
24.000 
39.100 
103.100 
74.600 
96.700 
139.200 
46.000 
62.000 
55.800 
29.900 
43.100 
16.200 
40.700 
37.500 
29.000 
37.800 
50.500 
41.100 
37.200 
60.900 
89.300 
27.500 
18.400 
46,300 
89.200 
44.600 
34.200 
23,200 



848 

848 

aso 

851 
852 
855 
856 
857 
aSB 
858 
860 
861 
862 
864 
865 



868 

869 

870 

871 

872 

673 

874 

875 

876 

877 

678 

879 

880 

881 

863 

884 

865 

886 

887 

888 

889' 

890 

891 

892 

894 

895 

896 

897 

896 

899 

900 

901 

903 

904 

905 

907 

90S 

910 

911 

913 

914 

916 

917 

819 

920 

821 

823 

824 

925 

926 

927 
928 
929 
831 
832 
833 
834 
836 
837 



1863 
1166 
1535 
1035 
834 
499 
1063 
887 
1446 
706 
1070 
472 
674 
1307 
645 
827 
685 
1807 
1323 
1228 
1904 
556 
1540 
1566 
1186 
1076 
1161 
547 
1756 
1543 
1432 
822 
1103 
1501 
796 
636 
951 
717 
1123 
891 
1245 
1962 
1322 
420 
662 
845 
624 
931 
799 
765 
775 
888 
828 
681 
1544 
1606 
1237 
1442 
1260 
764 
1133 
1123 
829 
1131 
1441 
679 
1487 
1082 
1231 
1609 
810 
965 
947 
865 
1421 



271 
523 
1024 
826 
542 
220 
194 
890 
639 
311 
1066 
347 
480 
499 
887 
10O4 
494 
402 
783 
1031 
346 
647 
756 
777 
351 
720 
1111 
757 
594 
278 
890 
689 
414 
607 
1103 
634 
759 
548 
229 
413 
234 
346 
626 
570 
428 
243 
703 
1094 
229 
520 
889 
824 
1303 
1544 
301 
387 
686 
749 
367 
1541 
1123 
380 
242 
318 
874 
219 
1191 
775 
816 
670 
900 
520 
462 
843 
1056 



•0.6 
•9.2 
■4.2 
-11.4 
-15.5 
-27.8 
-10.9 
-14.4 
■5.4 
-18.9 
-10.7 
-28.8 
•19.9 
-7.4 
-21.0 
-15.6 
-19.5 
-1.0 
-7.2 
-8.4 
-0.3 
-24.8 
-4.2 
-3.8 
-8.6 
-10.6 
•9.3 
-20.9 
-1.6 
-4 1 
-5.7 
•13.7 
-10.1 
-4.6 
-16.3 
-21 .3 
-13.1 
-18.6 
-9.8 
■14.3 
■6.2 
>0.0 
•7.2 
•31.4 
•20.3 
•15.3 
-21.7 
•13.5 
•16.3 
•17.2 
•17.0 
•14 4 
•15 6 
•19.7 
-4.1 
-3.3 
-8.3 
•5.5 
-8.0 
-17.3 
-9.7 
-9.8 
-15.6 
-9 7 
-5.5 
-19.7 
-4.8 
-10.5 
•6.4 
•3.3 
•16.0 
•12.8 
-13.2 
-14.8 
-5.9 



88.500 
54.800 
29.600 
37.500 
53.400 
127.100 
150.500 
34.800 
46.800 
86.200 
28.000 
77.600 
58.800 
57.000 
34.900 
30.300 
57.400 
68.000 
39,400 
29.300 
77.700 
46.400 
40,700 
39.700 
76.800 
42,500 
26.400 
40.700 
49,700 
97,100 
34,800 
44,100 
66,400 
46,900 
26.600 
47.200 
40,600 
52.900 
121.200 
66.400 
117.800 
77.700 
47.700 
51.300 
64.500 
113,000 
43,400 
27.000 
121,000 
55.200 
34,800 
37.600 
19.700 
11.700 
89.100 
70.400 
44,100 
41,100 
73,700 
11,700 
25,900 
71,500 
113.200 
84.300 
35.400 
128,200 
23.500 
39,800 
38.000 
45,100 
34,400 
55.100 
60.600 
36.800 
28.400 



839 
841 
942 
943 
944 
945 
946 
947 
948 
849 
950 
851 
852 
854 
855 
857 
8SS 
860 
861 
862 
863 
964 
965 
966 
967 
968 
969 
970 
971 
972 
974 
975 
976 
977 
978 
979 
980 
961 

963 

964 

965 

987 

968 

990 

991 

992 

993 

994 

995 

996 

997 



1197 
1765 
602 
312 
893 
1300 
630 
187 
1380 
1766 
1038 
860 
957 
503 
1838 
1010 
768 
596 
557 
867 
564 



999 
1000 
1001 
1002 
1003 
1006 
1007 
1009 
1010 
1011 
1012 
1013 
1014 
1015 
1016 
1017 
1018 
1020 
1021 
1022 
1023 
1024 
1025 



671 
1204 
910 
609 
1285 
822 
976 
403 
279 
844 
1124 
994 
1612 
749 
1064 
1197 
1752 
1344 
1024 
739 
816 
785 
1159 
1090 
1030 
647 
902 
686 
1815 
1205 
617 
968 
970 
1736 
643 
822 
875 
291 
1386 
459 
679 
1816 
1032 
1629 
1311 
1722 
1015 
1574 
781 
1129 
812 
785 
1290 



827 
885 
472 
496 
491 
269 
423 
736 
344 
665 
193 
152 
701 
547 
712 
816 
174 
419 
409 
320 
334 
1155 
2SS 
796 
154 
1046 
206 
232 
437 
567 
495 
961 
295 
664 
642 
1141 
642 
911 
1506 
317 
1106 
1159 
555 
361 
317 
928 
701 
811 
461 
847 
579 
504 
289 
290 
771 
478 
1164 
487 
279 
644 
745 
S41 

661 
1128 

634 

994 
1134 

424 

743 
1219 

464 
83 

317 

44e 

739 




•8.8 
•1.5 
•22.7 
<-35.0 
•12 1 
•7.5 
■21 6 
<-350 
-65 
•1.5 
-11.3 
-14.9 
-13.0 
-27.6 
>0.0 
•11.8 
-17.2 
-23 0 
-24 8 
-14 4 
•24.5 
•12.8 
-20.0 
•6.7 
-13.9 
•22.3 
•7.7 
-15.8 
•12.6 
■32 6 
<-35.0 
■15.3 
-9.8 
•12.1 
•3.2 
•17.7 
-10.8 
-8.6 
-1.6 
-6.9 
-11.5 
•17.9 
-15.9 
-167 
•9.3 
-104 
-11.5 
-15.2 
-14.1 
-14 4 
•0.9 
-8.7 
-22 0 
•12 8 
•12.7 
■1.9 
•21.1 
-15.8 
• 14.6 
c-35.0 
-6 4 
•29 4 
•19.7 
-0,9 
•11 4 
-30 
-74 
•20 
•11.7 
•3.7 
•168 
-9.7 
•15.9 
•16 7 
-7.7 



37.S0C 
3S.0CO 
M.60C 
57.1tc 
57.715 

'00.30C 
85.10C 

«'.ect 

78.206 
45.40C 
151.00C 
213.000 

43,400 
53.0C0 
42.90C 
37,900 
174,900 
65.70C 
67.10C 
83.900 

eo.soc 

24.800 
106.600 
38.700 
210.30C 
28,700 
138.900 
119.300 
63 40C 
5) .600 
S7.40C 
31.200 
91.100 
45 400 
46.700 
25.300 
46.700 
33.900 
12.800 
84.700 
26.600 
24 600 
52 400 
74.900 
84.500 
33.300 
43 400 
38.200 
60.700 
36.600 
50.700 
56.500 

63.100 

92.700 

40.000 

58.900 

23.700 

58.100 

96.400 

46.600 

41,200 

53.S00 

45.600 

25.800 

47.200 

30.700 

25.S00 

65.000 

41.300 

22.SS 

58.400 

591,** 
»4.eOD 
62.400 
41.SB0 



D*ub*M of ni Incr proiemt 



927 



Y CPKtf 



1DSS 
1027 
tOS 
iCaO 1284 
1081 W6 



40S 

i2se 

856 



10B2 
1083 
lOM 

toss 

1036 
1(B9 
1040 
10*1 
\0U 
1(M5 
10<7 
1048 



1547 
1381 
1S2S 
112B 
1226 
1761 
S41 
818 
1036 
1439 
1540 
1576 



104S 1089 
1060 »49 



1061 
1062 
1063 
1064 
1065 
1056 
1068 
1060 
1061 
1062 
1064 
1065 



426 
1583 

779 
1613 
1380 

2S4 
1261 

3B3 
1817 
1245 
1258 

705 



1066 1181 

1067 529 



1068 
1069 
1071 
1073 
1075 
1076 
1078 
1081 
1083 
1085 
1090 
1082 
I0B3 
1094 
1005 

tooe 

1090 
1101 
1102 
1103 

105 
i106 

107 

iioe 
nil 
^1^ 

■lis 

116 

ni7 

116 
Ills 

'120 

■121 
122 
123 

!125 

126 
128 
133 
139 
147 
148 



508 
1898 
873 
1768 
836 
1863 
826 
971 
1697 
1157 
620 
1867 
2019 
1546 
1545 
61 
1954 
588 
1050 
457 
1884 
1714 
1717 
1976 
547 
1348 
1385 
1078 
975 
1202 
1022 
1905 
1512 
1114 
1464 
1048 
1122 
1722 
1098 . 
1830 
764 
1968 



SS2 
848 

547 

226 
822 
403 

551 
496 

645 
274 

262 
839 
910 
485 
407 
250 
635 
411 
1040 
818 
1385 
1092 
620 
377 
663 
746 
605 
645 
746 
792 
934 
734 
658 
696 
604 
609 
1128 
773 
861 
566 
483 
202 
794 
910 
597 
894 
538 
477 
935 
237 
1048 
667 
797 
532 
649 
546 
722 
1066 
621 
762 
816 
787 
B33 
1076 
616 
1301 
677 
452 
857 
802 
892 
B2S 
569 
1182 
724 



-7^ 
-15.0 
-7.7 
-12 J 
-4.1 
•64 
-4J 
■9.7 
•8.5 
•1.6 
-25.7 
-15.8 
-11.3 
-5.5 
-4.2 
•3.7 
•10.4 
•13.2 
•31.1 
•3.6 
-16.8 
•3^ 
•6.5 

<-a5.o 

-8.0 
-33.3 

•0.9 

•8.2 

•8.1 
-189 

-8.0 
-26.3 
-27.4 

-0.3 
-14.7 

•1.5 
•15.4 

•0.6 
-15.7 
•12.7 

•2.3 

■9 4 
■21 .9 

■0.5 
>0.0 

-4.1 

■4.1 
<-35.0 
>0.0 
-23.3 
-11.1 
-29.5 

■04 

■2.1 
■2.1 
>O.0 
-25.3 
•6.9 
•6.4 
•10.6 
-12.6 
-8.7 
-11.6 
-0.3 
-4.5 
-9.9 . 
-5.1 
-11.1 
•9.8 
-2.1 
•10.2 
•0.8 
-17 J 
>0.0 
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X 


Y 
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&2.600 


1153 


991 

•C f 


11 u 


-13.7 


24.700 


36.500 


1 194 






•3.5 


35.900 


S3.000 


1 IB 1 




*oo 


-21.3 


68.400 




lis 


623 


397 


-21.8 


68.800 




1163 


6£5 


397 


•20.2 


66.700 


O/.vuu 


1168 


564 


528 


-24.4 


54.500 




1170 


552 


529 


-25.0 


54.500 




1171 


538 


524 


-25.9 


54.800 




1172 


545 


514 


-25.5 


55.700 


Qfi W\ 


• 174 


1099 


522 


•10.2 


55.000 
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Tible 4. Computed pr t of *ome tnewn proteiw reUted to measured CPK pfs 



Protein Name 

0 Creatine phospfio kinase (CPK). rafibil musde 

1 Fatty aocl-binding protein, rai hepatic 

2 b2-microglobulin. human 

3 Carbamoyl-phosohaie s/ntnase. rat 

A Prealbumin ( serum albumin precursor), rat 

5 Serum albumin, ras 

6 Superoxid dismuiase (Cu-Zn. SOD), rat 

7 Phospholipase C, phophoinositjoe-speciflc (?). rat 
B Albumin, human 

9 Ape A-l lipoprotein, rat 

10 proApe A-l lipoprotein, human 

n NADPH cytochrome P-450 reductase, rat 

12 Retinol binding protein, human 

1 3 Actin beta, rat 

14 Actin gamma, ra: 

15 Ape A-l lipoprotein, human 

1 6 Apo A-IV lipoprotein, human 

1 7 Tubulin alpha, rat 

18 F 1 ATPase beta, bovine 

19 Tubulin beta, pig 

20 Protein disulphide isomerase (PDI), rat hepatic 

21 Cytochrome b5. rat 

22 Ape C-ll iiooorotein. human 

Amino aad pi assumea in calulation: 
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l-D Otu6«w of rai ti«cr proicini 1977 

An updated two-dimensional gel database of rat liver 
proteins usehil in gene regulation and drug effect 
studies 

We have improved upon the reference two-dimensional (2-D) electrophoretic 
map of rat liver proteins originally published in 1991 (K. L. Anderson et al.. 
Electrophoresis 1991, 12, 907-930). A total of 53 proteins (102 spots) are now 
identified, many by microsequencing. In most cases, spots cut from wet, Coo- 
massie Blue suined'2-D gels were submitted to internal tryptic digestion [2 J. 
and individual peptides, separated by high-performance liquid chromatographv 
(HFLC). were sequenced using a Perkin-Elmer 477A sequenator. Additional 
spots were identified using specific antibodies. 



Figure 1 shows the current annouted 2-D map of F344 
rat liver, analyzed using the Iso-DALT system (20 X 25 
cm gels) and BDH 4—8 carrier ampholnes. Both the 
map itself and the master spot number system remain 
the same as shown in the original publication. Table 1 
lists the important features of each identification shown, 
including the gel position, p/. and M, for the most 
abundant or most basic form of each protein. Using this 
extended base of identified spots, a series of four 
improved calibration functions has been derived for the 
p/and SDS-iV, axes (the first two of which are shown in 
Fig. 2A and B). Both forward and reverse functions are 
derived, so that one can compute the physical properties 
of a spot with a given gel location, or inversely compute 
the gel position expected for a protein having given 
physical properties: 



^RaTLIXER ~ /w— RaTLIVER ■>. '■^^:SEOUEVCE DERn-£D) 
•^KATtlVER ~ ffi-MXWlX X (PAeOLESCE-DEH\Eo) 
^rOEL-DEmvED = /nATinTH Y-Mr f ^HATLIVEIt) 
-OERIVIO ~ AaTUN-ER X-»I (■^IIaTU%Tr) 



(1) 

(2) 
(3) 
(4) 



A spreadsheet program (in Microsoft Excel) was devel- 
oped to facilitate flexible computation of p/'s from 
amino acid sequence data, and the results were entered 
into a relational database (Microsoft Access). A table of 
spot positions and sequence-derived pi's and M's was 
fitted with a large series of analytic equations using 
Tablecurvc (Jandcl Scientific), and the four conversion 
Eqs. (1M4), relating computed p/ and gel X coordinate, 
or computed molecular weight and gel Y coordinate, 
were selected, based on criteria of simplicity, goodness 
of fit and favorable asymptotic behavior. Table 2 lists the 
equations and coefficients. Application of Eqs. (3) and 
(4) to a spot's A" and K coordinates, given in [1], produce 
improved M, estimates, and allow compuution of p/ 

Coirnpendencc: Or. Leigh Anderson. Lar(e Scale Biolosy Corpora- 
tion. 9620 Medical Center Drive. Rockville. MD 20gS0-333S USA (Tel: 
+301-424-5989; Fax: +30I-76JU892; email: leijheifbe.com) 

Keyword!; Two-dimensional polxacrjrUmide gel electrophoresis / Liver 
/ Map / Identincalioo / Calibration 
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directly in pH units, instead of in terms of positions rela- 
tive to creatine phosphokinase (CPK) charge standards. 
The inverse Eqs. (1) and (2) were used to compute the 
gel positions of a series of p/ and M, tick marks. These 
tick marks were plotted with SigmaPlot (Jandel), 
together with fiducial marks locating several prominent 
spots, and the resulting graphic was aligned over the syn- 
thetic gel image (computed by Kepler from the master 
gel pattern) using Freelance (Lotus Development). Maps 
were printed as Postscript output from Freelance, either 
in black and white (as shown here) or in color, where 
label color indicates subcellular location (available from 
the first author upon request). We have also used the rat 
liver 2-D pattern as presented here to calibrate the pat- 
terns of other samples. Using mixtures of rat liver and 
mouse liver samples, for example, we made composite 
2-D patterns that allow use of the rat pattern to standar- 
dize both axes of the mouse pattern. This was accompli- 
shed by deriving transformations relating the fat and 
mouse X, and separately the rat and mouse Y, axes 
(Table 2, lower half; Fig. 2C and D) based on a series of 
spots that coelectrophorese in these closely related spe- 
cies. These functions were then applied to derive equa- 
tions relating the mouse liver jf and Xto p/and SDS-A/, 
(Eqs. 5 and 6 below). The resulting standardized 2-D pat- 
tern for B6C3F1 mouse liver is shown in Fig. 3. 

^rMOL'SE LIVER ~ yilATLIVER V-Mr C/mOUS t LIVER >-»TLIVE« V 

( ^mouselpver)) (5) 



MOUSEUVER 



= A 



RJ,T LIVER X-pl (AtOUSELIVER X-RATLIVER X 



MOUSE LIVE 



r)) 



(6) 



A slightly more complex approach can be used to stand- 
ardize samples that have few or no spots co-electropho- 
resing with rat liver proteins. In this case, a 2-D gel is 
prepared with a mixture of the two samples, and four 
functions (forward and backward, each for X and Y) are 
derived relating each sample's own master pattern to the 
composite. The required functions are then applied in a 
nested fashion to yield the desired result (using rat 
plasma as an example): 

^rRATPLASMA ~ AaTLIVER V-M, C/iuT PLASMA, LIVER Y-RATUVER Y 

C/iuTPLASMA Y-KAT PLASMA* UVER Y ( ^RATPLASMa))) 

<7) 
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445.64 66 354 


1317.72 5.91 
1730.72 6J4 
1547.96 6.14 
665 J3 5.01 


589.03 49 602 
756.02 37 819 
849.08 33 186 
397.39 74 564 


1996.60 6.72 
23.05 4.03 


1017.03 26 887 
1433.25 17 419 


310.59 4J4 


433.80 68 206 



Bltexffkitiu \fti. It. i»n-ini 
Takk 1. continued 



J-D Daubut of ru Imr proteitu I979 



MSN" 



Protein IDb) Protein ume 



Identification eomisenu 



Gel X" Experimemil Gel r" Expenmenul 



1 iM, J 110, 


CrSM.RAT 


1 1 J 1 IIS 




5, 1<7. 157 




M. 61 


CATA.RAT 


1 lit 


C0X2.RAT 


n 


CYBS.RAT 


4] 


CK-RAT' 




CK-RAP' 


< 11 


cNPL-RAT^ 


60 


enoa.rat 


17 


ER60.RAT 


17 


atpb.rat 


196 


ATP7.RAT 


79 


FIAP Rat 


62. 78 


DHE3.RAT 


12S 


HAST-RAT" 


307 


HOl.RAT 


413. IMO, 


HMCS.RAT 


933 




133, 144. 235 


HMCS.RAT 


8. 23. 1307 


HS7C.R.<T 


15. 25. no 


P60.RAT 



971 HS75-RAr' 
1216, 1215, 90 HS90-RAr' 
2i6 INCI-HUMAN 

415, 734 



SO 

227 

134 

It. 35, 226 

175, 251 
1I6S, 1170, 
1171 
47, 93 

236 
320 

152 

1179. JIM, 
lltl. 1182. 
1113 
55, 103 

135 

172 

277. 56 
SO, 1225 
1224 



LAMB-RAT" 

LAMR-RAr' 
FABL.RAT 

MDHCMOUS 
E 

GR7S-RA-r' 

NCPR.RAT 
PDI.RAT 

ALBU.RAT 

APAl.RAT 
IPKLB0V7N 

PNPH.MOUSE 

pyvc-RAT' 



Cartuaiyl phmphaie 
ijDtluse 

Cauliu 

COX-II 

Cvtodireme B5 

Cyukeniis 
Criokentin 
Endoplasmin 
Esolase A 
ER-60 

Fl ATPue e 
Fl ATPase 6 

Fmciose- 1 .6-bii-plsDspbause 

Gluuouu debydrofenuc 
HA5T-I: N-bydroxyuyl- 
imine sulfcuinsferuc 
Heme oiytenue 1 

HMC CoA syntluse, 

eytosolic 
HMG CoA syntbase. 

mitoeboodnal (!nt) 
HSC-70 

HSP-60 

HSP-70 
HSP-90 

loterferoo-Y induced 
prouis 

laminin receptor' 
L-F.aBP (liver fatty acid 

biadint protein) 
Malaie debydraienase 

MiieooJ; crpTS 

NAOPH P450 reductase 
POl: Protein disulfide 

isoffleme 
Pio-AJbumia 

Pto-APO A-1 lipoprotein 
Protein kinase C inhibitor 1 

Purine nucleoside 

pbospboiyiaK 
Pyruvate caiboxylase 



2-D of pure prouin; eomlimied by 
Mtenninal lequeoce and AAA 



1453^6 6.05 



Henslee. J£C, 1979): reported m Biochim. 
Biophys. Acta 1022. 115-125 



SKOO.RAT 


SMP-30: ScDeseaee 
marker proieis-30 


Interna) sequence 


721.71 


5.11 


SODC.RAT 


Superoxide dismutaae 


AAA; comfirmed by internal sequence 


1161J4 


5.74 


TPM-RAr' 




(R. M. Van Frank) 






Tm: tropomyosin 


Location in cytotkeletoo. 2-D posiiioc 


476.24 


4.66 


TBAI.RAT 




relative to human. Ab 






Tubulin 0 


Positional bomoloiy with human tnroujb 


688.22 


5.06 


TBBIJIAT 




coelectropboresis, cyioskeletal locaiioo 






Tubulin i 


Positional bomolo(y with human tbrouch 


621J9 


4.93 


VIMEJIAT 




coelectropboresis, cytoskeletal locattos 






Vuneottn 


Positenal bemolocy with human through 
coeleetrophoresii, cytoskeletal location 


673.00 


5.03 



181.64 160 640 



Inienul sequence 


2000.tl 


6.73 


499.64 


58 968 


Ab (J. W. Taanman), eonfimied by 


452J7 


4.61 


1062.67 


25 504 


interna) sequence 










2-D of pure protein; Ab; conTiimed 


J 1 J.DB 


i Tt 




18 493 


by AAA 








Location in cytoskeletal fraction 


1165.12 


5.75 


569.09 


51 448 


Location in cytoskeletal fraction 


743.11 


5.15 


605.23 


48 ir 


Ab (F. Wiizmann) 


567.73 


4.83 


263J7 


112 194 


Internal sequence and AAA 


1399.78 


6.00 


623 34 


46 674 


A^-Termina) sequence (R. M. Van Frank) 


1184 JO 


5.77 


523.51 


56.169 


A^-Terminal sequence and AAA 


629.06 


4.95 


588.83 


49 620 


Internal sequence 


1227 J4 


5.82 


1184.65 


22 310 


Uncertain; by comparison with ID in 


924.54 


5.44 


737.77 


38 858 


Carrisoa and Water fJBC 257 I3n5.ni^ti 










A-Tenninal sequence and internal sequence 


1887J9 


6J5 


566.92 


51 655 


Internal sequence 


1297.94 


5.89 


86135 


32 638 


Uncertain; available dau from inieraaJ 


1219J9 


5.81 


915.71 


30 423 


sequence 










Ab (J. Cetmersbausen) 


1033.48 


5.59 


538.13 


54 571 


Ab (J. Germenbausen), A'-termina) 


666.40 


5.02 


1019.42 


26 811 


sequence (Sieiner/Loiupeicfa) 








Positiona! homoloiy (»itb human, etc.) 


811.87 


5.27 


425.76 


69 521 


tbrDUfb coeieeiropboresis 










Ab (F. WitzsBui); confirmed bv N-ierminaJ 


845.09 




on m 


3D >0 1 


sequence and AAA 










Ab (F. Witzmin) 


976.11 


5.51 


4r.i4 


67 674 


Ab (F. Wiizmin) 


659.86 


5.00 


329 


90 107 


Internal sequence 


993.85 


534 


1006.04 


27 237 


Positional homology with human through 


737.10 


5.14 


425.19 


69 615 


coelectropboresis, nuclear location 










Internal sequeoce 


534.02 


4.77 


697.62 


41 327 


Ab (N. f»1 Bass) 


1586.09 


6.18 


14S3 43 


16^22 


Internal sequence 


1270.85 


5.86 


861.96 


32 620 


Positional homoloiy *>th human through 


905.67 


5.41 


413.67 


71 589 


coelearopboresis 










2-D of pure protein 


824.69 


5.29 


393.21 


75 366 


//-Terminal sequence (K. M. van Frank), Ab 


564J0 


4.83 


528,47 


55 618 


Microsomal lumen location, p/, U, relauvc 


1391.03 


5.99 


446.68 


66 195 


to albumin 










Coelectropboresis vith plasma proieic 


920.41 


543 


113731 


23 467 


Internal sequeoce; homology with bovmc 


1480.01 


6.08 


1458.81 


17 007 


protein 










internal sequence 


1507.19 


6.10 


911.16 


30 599 


Tentative; 2-D of pure protein (J. G. 


1485.10 


6.08 


22332 


131 589 



830.10 3 4 051 

1388.68 18 173 

957.86 28 865 

537.67 54 620 

535.4 8 54 855 

53930 54 426 



1980 



N. I_ 



lUk 1. GOBtianed 
MSN" h«ieaID6) 



Pro Lac uae 



"BBPL.RAT 



Identifiotioo comments 
loiereal lequenec 



Gel X" Expemnenul Gel Y" Ejpenmenul 
610.43 42 469 



119 US 5.7J 



DOi in Mquenee 

2} U>a moiphine-bmdini Internal sequence 773 j] j ,q 

protein 

iaiter spot ouaber (MSS) from 11] 
<^PROT identifier 

^^d^^Tef Jf' «»« h«ie or most .bund^t «,i,ned .pot on the FJ44 m„ter gel p.tten, 

»iMPR?T s!,.?V:;Ld"LTifieT" "'"'"^ ^""«— -nCude. here 

cviaiions: AAA. amioo acid analysii; Ab. antibody 

2. Equations and eoefCcienu 



lin.41 22 363 



Equauon (0 
y - f(compute< .t/,i « - - ttxjx-x/c) 
X - neomputed p/, y • e tx - a/lax 
iei M, - fitii f:l )0 v - « ^ bxe 
sad p; - nni rel -n y - c -Hx cx' * dx' lux 



ilx 



0.98ilgl02r 
e/jJ-i 0.99247216 
0.9960 IT? 
■ ot^ 0.99176499 



178.74803 
-86jS66SJ 
-8464J809 

4.044686 



1967.7892 
-904497.94 

19095881 
-0.00114238 



32363.958 
3856926.1 
-0.9086255 
0.0000323 



gel r- nni gel Y) 

eel X - flrai gel JT) 
y - flmouse (cI Y) 
X - ftmouse (el X) 



y a bx + or'-' + rfo*-* lax + 

r - o + Ax' inr + ci" + di^ 
y a * bx' liix * ex" + rfx^ 
.> - o * ix ct^ Inx ••• ax'-' + 



0.99951069 11861.44 678.91666 

0.99926349 58.935923 0.0009 13S3 

0.99950032 69.740526 0.00050772 

0.9992832 -198.07189 2.0899063 



-0.78964914 
-0.000213688 
-0.000130392 
-0.000671191 



18276844 
-0.00000455 



1567 J639 
0.00000159 
0.00000116 
0.000145189 



-27154534 
0.00000000176 

-6953.9592 

-0.000000986 



y=a+bx-K:x/lnx+d/x+e/x'Xl .5) 




B 



y=a+bexp(-x/c) 



> 

"5 




SOOOO 100000 
computed MW 



150000 



y=a+bx+cx'>2lnx+dx'^2.5)+ex'^3 



2500 




D 



y=a+bx'^21nx+cx'^(2.5)+dx'^3 



1000 2000 
_B6C3F1MST2_X 



3000 





1750- 




1500- 




1250- 


> 


1 






1000- 


in 




S 






750- 


n 




u. 






500- 




250- 




0. 




0 



500 



1000 1500 
B6C3F1MST2.Y 



2000 



2500 



i^eni?a^,!j^. T ;'"-r^^^ (continuous cut^es) to dau on identified pro.e.ns (square symbols). (A) p/ cort,puted from 

Hi rL^Mfo?^ S^Ff»o!L H^^^^^ v*"^^^ mouse liver ver,., X position in F3443 ra, I.ver. for coe.ectrophoresin, spots; (D) 
wJ'Z ^S^^ a^S^r '"'"'^Phorestn, spou. In each case, inverse equaiion 



J-D Ouibasc of rai Ur,t proiemt 1981 



[ B6C3F1 MOUSE IJVER2-D PROTEIN PATTERN ^"^^^^^^^^^^T "] 



u 

I I 



Efibopla 



KAOPH Cyl P.«Sa 



c 

2 ** 

"a 



3 

<J 

-Si I 
O 



— MB 

5 •■ — w 




Mitocnondna Miaoaomo 
reuumine Cyt oih i U l o ii 
Nude Plmvi Cvwl . 



Ttnutat 



I 



I 

la 



I I 



I I I 




F.gvrr J MiSltr 2-D gel paurm for B6C3F1 mouse liver, siandard.zed usmg the F3« rat liver paiicrn ideniir.caiions. according lo the method 
aescnoed in the text. Twenty-nine proteins are ideniined. 



P^RATfLASMA = /»ATUVlll X-pl (/ilATrLASMA.LIVEX X-HAT LIVER X 

0»ATHAIMA X-RATFLASMA.trvSR X (-^RAT TLASMa))) 

(8) 

This unified approach, in which one well-populated 2-D 
pattern is used to standardize a family of other patterns, 
has the additional advantage that the resulting p/ and M, 
scales are directly compatible. Hence one can compare 
the relative p/s of mouse and rat versions of a se- 
quenced protein in a consistent p/ measurement system, 
and select likely inter-species analogs based on posi- 
tional relationships on common scales. Adoption of 
immobilized pH gradient (IPG) technology [4—7] will 
result in . substantial improvements in p/ positional 
reproducibility for standard 2-D maps such as those pre- 
sented here; however, we believe that our approach will 
continue to be useful in establishing the empirical pH 
gradient actually achieved by such gels under given 
experimental conditions (temperature, urea concentra- 
tion, etc), in relating patterns run on different IPG 
ranges and using different lots of IPG gels (between 
which some variation will persist). Development of 
rodent organ maps is a continuing effort in our laborato- 
ries (8-10), and results in regular additions of identified 
proteins. Those who wish to receive current rodent liver 
maps, with color annotations, should send a sumped 
self-addressed envelope to the first author. 



H'e would like to thank the individuals who provided ami- 
bodies mentioned in Table 1. and R. M. van Frank for un- 
published sequenced data. 

Received May 31, 1995 
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Progress mth Proteome Proie'-ts. Wi, „ 
Proteins Expressed bv a r? 
Identified an'd How t Do r^"' 

-NO KEITH L. WILLIAMS- "^"™'- ^^'^ F- HOCHSTR^j^^f ^ ' 

Geneve U. S^nizer/and "^'^'^ t^«" rrj/n- nf Geneva. CH 12 1 1 

Introduction 

o^::T:^i^^^^^ t. ... 

compl.t. nucl.oi.dc sequence for bactc opl,aLT,r""'^^ ^'"'^'^^"on of .lie 
scquenc=ofaneu.aoo.cchro.oson,:Xr^^^^^^^ 

" ihe definmon of alJ open readme fram-c of! 'Senear future will 

>-"-'"^^ ele.ans ^n6Arah,Jnps:sMana%,^^^^^^^^ Car.."! 

'•re no. an end m themslevev In fact (hev onK r ' ■^^q""c.nenroiecK 
■np .he funcuon of an or,an.m. A " ea'cha^^^^^^^^^^^^ 

^o-=xpre.sMon of thousands of een^s can b. i"^! ""^ how ,h, 

^, vFr- vsion dcrinc an orsjn- 

■ Ciin-ejpond.ng Author — 

^^^^^^^^^^^^^ 



20 Marc R. U'ilkins er a!. 

ideniify all cDNA species, imd the approach doe. oot e«iiv 

..creeninE. Analys,. of gene expression by the smdv of pr-^e tZn^T'T' 

t.-ue present, a favorable al,ema»ve. Thr. can be ^^iieveS ^ 

.-D, lel elecrrophores,s. quannianve computer ,ma,e an^C. s a^^^^^^^^^^ 

canon technique, to create 'reference maps' of all d-tectabl-'Dmi-.n. iT f 

map. establish patterns of norma] andZon^l «„e '^^^^^^^^^ "'^ 

allow the examination of some post-ir^S^L^? ^ ^ r ""^ 

functionally important for manvprote nT T^^^^^^ mod.fic.t.ons wh.ch are 

cally from reference map. to es'tabl.sh S!eir idcS " '"'""^ ^> 

To define protcan-ba«d ecnc expression analysis, the conc-m of ,h. • 
vva. recently proposed fWilkinse;a/.. 1995; Wasineerw,// 199? / P^°'^°'"^" 
enttre PROTetn complement expre....ed bv a genOME orV"- a ceH o^^"''""'' 
concept of the proteose has some differencef fro.tat oHI^^^ 
1. only one definnive senome of an orpanicm - ' "''^ 

as protein product, „pr«.«d by jhemaiive „„. ,plicin. o7t ,h /ir? 
.ran.„at,ona, n,od,r,ca..ons or. ob.«n.ed „ „p ^^001° on a 'd ^T"" 
extrapolation of the concept of the 'eenom^ nr^i.^.- ; • " 5*'- ^" 

;h,ch .0 .d=„„/> an'd — ^ 

define their patterns of expression. °' ^n^l 

Proteome project, present challenges of a similar masn.tude to thai of o.nn 
projects. Technically, the 2-D gel electrophoresi.. must b^ reproducible and of hTh 
resolution, allowing the separation and detection of the thou.sands of pro cin" in n S 
Lou copy number protein., should be detectable, "nierc .hould hi rn^T? ? 
analy..issy.tem.thatcanqualitat.ve,yandquan.ita!?;e7ya"^^^^^ 
separated proteins, to form reference maps. A nin« of rapid ird rehab?, k "^ 
must be available for the identification and characteh..atrof pro^i^^^ • 
quence of a proteome project, protein databases mu..t be a.scmb'Td tt, / 
reference mforrnation about proteins: ..uch databa.es must be hnk.H 
datal.«.and protein reference maps.Datahasesshould^^^^^ 

Recently, there have been manv change, in the itrhnin,.-. i 

for ana,,.., of pro,con,„. ,t ,; ,h. a™ of 

™„.n=d above, and to rcv.eu- br.n, the pro,rcs. of ,on,e c:;:;7r;,:Lt 
Two-dimensional electrophoresis of proteomes 

Two dimen..ional (20, gel electrophoresis involve., the .eparation of r>rote.n. hv ,k 

lions to the original lechniques. 2-0 elec.ropho«s,' fomiv Lr k V "'""'""^ 
projects .hro„,H separaiin/proteins b. 0.7^Zl:::,:':>to2::Z°TZ 



nuress xx nh prnn nmc p-nu n. 



HEPG2 2D-PAGE MAP 



caimeuitn 

I 



' % • • • • 



» • 



EFIB 



tiSPK 



Apo Al 
TCTP 

ATT. 

synthAM O 

\ • 



PCOH- 



t 




# 



9iutathien ^ MER5 
•J S-translersM 
. eytldytau kin«s« 

traiuihyrelin 

FA8P • ' 



PBP 



thiorMoxtn 



loxtn N 



HBB • 



ubtqulun 




'y9::C.l,s./a/.. 199V Carrels and Franza. 1989: V.nBo^clcn r, ^/ IPQ^, r 

protocol., can resolv. ,u-o ,o ihrec thousand proicn. from 1"! 'i* 

Mnsif 2cl (Figure ]). proicm. Irom u complex sample on a 



:-D GEL RESOLUTION A.\D REPRODUCIBlLI-n- 

A pnman- challenge of separaung complex mixtures of protein, bv •'-D an ... 
phoresis has been to achieve high resolution and reprc^uSi.y Ai^h ' 
ensures that a maximum of protetn species are separat'cd. and h!;^rep:fdu7;;i!r.s 



MaRCR. WiLKINSr////. 

vual .0 aliou comparison of £eJs from day ,o da^ and he,-^.n r.- . . - 
factors can b- diffjculi lo achirv-. . -^no ner . .p nsi^rzh Mif> Th^^^ 

Crrnsr ampholyis> ar: a common msanc of Ko-le-iri- fo 

d:mrnMonof:-D.lccirophorr.rvGeKarru,ua]h 0^^^^^^^^^^ ''^ 
proirm. ,n thr pi ranee 4 ,o 8. and run .n a n"; ^''"'"^""""^ ^^para.r 

Oharrell. 1977, Unfonunaiclv. the u.e of S^ir . , ^"^ 

foru.nrprncedur.,. .u.crpubi; to cathode dnH^^^^^^^ ^ 
h> r -eforu^mg of ampholyie. .loulv chance u-„h i".™., P ?r■16^znx. esubi.she d 

Carr.er ampholyte pH gradient, are also c^l "Tuv f ■''"-m 
.amr:e. . B,eliqv.t . , 9S:. and by hi,h protein ,oa^ .o'^a;^., l^TrS^ 
iimitaiion i v thai ,so electric focusine eels, u-hich ar^ r.<. • , Z ' ^ " '• ^"nhcr 
^.^ .n narrow gJas. tube, need to be r x^^ded bv mt^h ' i '"'"'T ^'"'"^P^^r.- 
.0 the .econd d..ens.on - a procedure .hat^oie"^^^^^^ I ^^^^T 
man> of the above shoncominc. can be avoided bv loadin" .m^r " '^^^ '^"^'■'''^>- 
rad,olabeIled ..amplewGatrek. ,989:Neidhar^^^^^^^^^^ 
1990.. H,2h .enMt,v„y detection is then uch e .cVthmu.h ' ''"f ^"'^^^^-^ 
Pho.phonmag,ng plate. (Bonner and Las^ev 974 1; "''r?^ """^^^=--Phy »r 
1990: Paner.on and Latter. 199.^,. Hou.eve^thiJtn^ ""'^ 
or.an,.m. or n..ue. that can be radiolab/lled " '''^''''^^''^ 

An alismatne lechnioue which iv K.^»^- i. 
U,n,.„.,on scp..a„on of pro.ci^l nto r?!"^. for ,hc fir. 

i.m=. IPG ar, usually poured o„,o = 'iffC n ta\'rf • '""^ 
.rnn. and prov,d=.> =a,y yd handlme CO«„,:ren Eril^Tvo'L B ,1^ " "'''■■''^"i-"> 
miijor advanuj;. of IPG .scpara.ion^ art iha"t Th^wU ,^ ^jf llqv,„. i yss ,. The 
.h=.^ allow focuMn. of baMcan"«n aaj cnrl-, "°' 'r''" " 

..arrn>v pH ran-; arc poss*]. ,0 1)^ r,H ,„ T. "■^P^'^"""- <" or a vcrv 

However. ., . no, cu.en„v p-ol.Wc o " etpc i:;";,""'^ • 
pro,e,n- of ,-oelec,r,c po.nl .-rea.er .han n ? I 

N^rrow PH ra„,e .eparr„on.ar: u t ddre! pl,j I" '^^ 

complex samples, allow, ng ,„■ on^^^^ttr: ^ iT;''''" 

».r,p. are now commercally available, ^h.ch hennTo ddrl; '^'r,'' " ' ""^ 

:ind ,n,:,.iah isoelecric /ocus.nc repn^luc-ibilnv "''•'"'^ r 

There are ,wo means of elecirophoresiv for'ihe sec.md .l,m.» 
pro,e.n. ven.cal ..lab „K and honzonul ul,ra,h,n = 1 Cor ~ rr"'"? C 

~uc.,„, ,e> . no, usually u^Sj^' irbu n re^ir^Xru'" 
Sel seiups iCorc. Posiel and Guniher 1988, Cr,™.! J •'°"'onul 

or no difference ,„ ,he reproducib U y S^Zr: " 
iCorbe., e, „/.. 1 994a,. bu, commercial y avai^^^^^^^^^ ^PP'o-c" 

>-.provide,rea.erreproducibil,.yrorc^car!V;r:V:^^^^^^^^^^^^ 2 



Wtt 

Pf. 

th 




F.purc Ts*... dimensional pel elecimphorcM. allowv ^v^m.f,? ,r ..n -^rc.^ ,„ r,„„, h.-hl.-h. 

. rroiein. cmmon .a «rh pel ( A • W.de pi ranee .u.. U.menM.mal clear.,pn.,rcM> map ..r humun N , „u 
rro,c,ns Firs. d.mens...n .epnra.inn wa. a.-he.ved u«„c an .mmnn.i.cd pH .r.j.cn. of .V> ... o „„ 
Tnr «rnnd d.mens.on ua. SDS-PaGE. Actual pel me wov ,^,,„, . p„„,,„, ; 

-..n silver Moinrng (B. Narrou pi ranpe eier.rr,ph„rc.,. u-.,. „.cd .o v^nm m" .,n , vm.ll r....„n 'f ,hc 
Piasma map Tnr first dimcn.Mnn used a narrou ranee immn^hved pH L-rud.en. of - : ^ -"un.is and 
^e.ond dimension SDS-PaGE. M.rropreparame Uiadmc ua. u.ed". and ,hc ccl hi.nicd ... PVDF 
Proiein« were » isualised « iih am.dn black Actual blot MZt «,a. Ih;-m x Tii.-m 

the u« Of piperazine diacn iyi as a gel crosslinker and the addiiion of ihiosulfaie in the 
caialysi syMcm has been shown lo Eivc beticr resolution and hisher ^ensiiivitv 
detection (Hochstrasser and Merril. 1988: Hoch«rasser. Paichomik and Merrii. 

1 9So ). 
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Notu ithsiandins iht advanc?^ dtacribed ab-vf ,i ■ 
improve ih- rs producibih.v ofl-D eiscirophorcsi/.o f!.Tn ^'^'»nd .o 

companson used ,n database con«J,lon c^^^^^^^^ ""^ 'o «I 

uncrnam.v ,n gel databases. To address ihese iles """"'P^^^''^ 'i^srer'of 

gri separuijon. ha. been undenaken (Nokihara iJ P^"*"' ^^'O'n^'ion of larce T-D 
f'"/.. lyy.M. -^'tnoufh results are preiiminan.- < •>-""m. Jvy.. Hamnsion 

m one stud> u ac found to be threefold improC-ed ovV° POMt.onal reproducibil.u 

J 993 k It should be noted that .m^u '^^ LTr ^^^^^^^^ 
almost completely automated (Bttsv-t er al jSa, ^^^^'^ 
used for database studies. " a'thoueh these are not seneraJly 

MICROPRHPARATIX-E VD GEL ELECTROPHORESIS 
Whh the advent of affordable protein microchar,....- 
..rmmalm..rosequenc.n=.am.noacidanaMf^^^^^^^ 

anulysK. and monosacrhuhde composttionaJ anuK-s^^^^^^^ 

pnorest. ha. been to maintain hteh resoluZ and " f 1"'"^°'-^ 

protein ,n sufficient quantntes for chemtcaTanaW^^^^^^^^ P-v.de 
uuantit.e. of protein, per spotK This becles d^^^^^^^^^^ 

samples .uch as whole bactenal cells, as the mitia^nrn 1 . """^ ^"^P'^^ 

.0 40()0 protem species. Two approaches arH ed 7nr " '^""^'^ ^'"^^^ -"^ 

that can be chemically characterised. The f s^met^^d'^^^ '""""'^ °^ '"^'^nal 
and pool the spots of interest, and .subject .hem ^rn^r "'""'^'^ 

i995:Rasmus..n../.. i 99: ]n ti^X^,^^^^^^^^ i^^^:Wa.sh 
aKo act as . purification step to remove :iccuZT.,^ V^^ "^^^^^^ 
-ch a> slycine. .A more ele.ant appr^h h/s ^'n " ^ ="':°P^°-'- contaminant. * 
or IPG isoelectric focus.n. The Z^^o^^^^';"^' '-'i-? capacity 
was described early ,Ek. Bjellqv..,-and R.^h^tt'TJ^.? h'^^^ -^^^^^'"'^ 
appl.cdto:-Ddectrophore.>,Hanashr;./ ,99, B^^^^^^^^^^ ^^«""> ^>^-n 

of protein can been applied to a s.nele ^el vie drn^ ri^ " ""'^ '-^ 

dred. o, protein spec.es a funher benefit thi ^'"rr^ ''"^"""'^^ 
)ow abundance, which ma> not be Msudi' ed ^ ,ou ' " ^^^'^'"^ ^^rcsen, m 
to be detected. The use of elcctrophore^or c^^ro" " ''°T 
n.ques tHochstras.er . i99la'Ha"n™ ;r9;"^^^^^^ 
of narrou -range IPC separations f BieHqvi^r;,,/ , gofh r'"^ ^'^''^ '"^'^'"^^ 

^.udies on proteins pre.sent ,n low abundanc' '^^•'^'P^"^ '^^^- likely solution to 



-Methods of protein detection 



I™:=~™^^^ .-..The .c,.. ... . 

puT>o.«of,hcsel(forpro.c,;nua,^ra,™lf" I '"""'•"■"I or prcparanvc,. ihr 
.ion,, .nd ,h= «n.MUvi,v rcq^d ~ "^^^^^^^^ ^h™'"' ch,r.c.=risa! 
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Table I: Z<—r>,\n ttai.n for T-D celt rr hlois and Jheir 3n»lu:aiuini. 



Df leriii'n 
Mciniicl 



Main 

aprliiaiion' 



L-n»uii3Wf 



Rcirrffuc* 



radintarwilinc and 
riuiirt'crarn\ or 
pho«phnrimjCinf 

J "Slihinursa silver 



Sitvrr 



r:5o 



Ci>li(ii(i.il ;i>IJ 



ZiriL' imida/cilc 



rnni.r;iL S and 
niuiLli' hi J. I. 



lnJi.1 ink 



Jell iiRM. 
.uliured nrranivntt 



H«iremrlv hirh 
»en»lll\II^ pel 

i vin pel Maininc. 
fan br mnno cir 
piil\i-hriim3iii" 

Siaininc nf eels, 
^laininy of P\'DF 
mtrnrirjnrt remrt 
pniicin «caucn^-inc 



Staininp NIC 
rucmhranes. 
Maininj PVDF 
hefort iJircri 
MALDI-TOF 

Reverse iiainmc 
of Bci* (»r mem- 
hranes: may he 
hrnefifial in 
MALDI-TOF 
i»f pcpiulci 

Suiintne hirher 
priiicin lciaiJ'> on 
r'\'DF. fur prmcin 
^cousnt-inL' nr .imin« 
analv \i\ 

.diamine tif 
mcmnranc-hi>unj 
prnicin.s. siaintnL' 
P\ DF hei'iirc dircti 
MALDI-TOF 

Sijinini; ii> Ocicci 
ylyvuproieinv nr 
Ca ' hinjlinp 
prciiein« 



Sampie* thai 
canniit he bhelled 



Prcparaiive I-D. 
PV'DF tif NC 
mcmhrane* 

Preparaiixe I-D. 
PVDF or NC 
membranes 

Suininc prior to 
(Jireri maw deicr- 
minaiinn irntn 
PVDF; amino atid 
analvMs on PVDF; 
(iciectinn nf smnc 
clvi-oprpicm* 
Cell 



Where pi'stiive 
image k required 



Siaininc prmr m 
(iircci ma«^ 
(IcicrminaiiKn irtim 
PVDF 

Gel <iaininp. nut 
uuaniiuiixc frum 
proicin In prnitin 



General ccl Mainin;: 



^(•rnn""' CanelvanJ FijB/a. 
rajmlarfi in |u«w 

a »pm LaiJum. Garrr'v anj 
Suiter. 

II J nc pr.nein W alta.c anj Salu.'. 

iin «ir hjnJ l"¥;a.h 
iif pel 

J nj pnncin Rahilluud. lou^ 

on spiti nr HitchMra»«cT .i-'j 

hand i»f pel Meml. ivxs 



Jf> nj prmcir 
on hafuJ or 
*p«>i of cc! 



«• htcher 
ihan 

C(>t)ma>Mi- 



Hipher than 



MXlns 
prnicm on 
hand or »piii 
of ccl 



Strunai ri nl . |us)j. 
GkirahJaL-lii r; . 

Coldhcry cuiL. IWKK; 



^'amacufhi and 
Aol^aK-a. 19X8; 
Eckcr^korn ci til.. 

Sirupai rf nl.. I»»«»4 

On\f fi nl.. |uu2: 
J;iincN ri nl.. 



Sanchez r/t(.'.. ]*i^2. 
.^triip.ii ri III . I'lwj _ 
\\'ilkin« ri It!.. IM>»> 



I-'""? Lif7«/.. msv. 

H»j;.-hc>. Mack and 
Hamp;irian. I«SK. 
Sirupai n nl . iviwj 

HKi n;.- pruiirin C jinpScll. 
onhandor M.i.Lcnn:in and 
vpi'i i>l yrl J'TL-rnscn. I**XV 

Giildhtfrp n nl.. (wkk 



P\ p= 3 r"i> wfn.iufnr dillmiritte. sr ■ niitticcliyl»\c. MALDI-TOF ■ 



• iirtni m3»« tpcririimcirN 



fiuinv j..i»icj ij»i-r iicM.fftiM,,, »,n,v.,i,„i 



example, .^oms slycoproiein.*; are noi siamed bv coomassic blue (Goldber" ci al 
1 988 ). and mun>- oreanic dyes are unsuitable for protein dciecuon on PVDF if samples 
are lo be used for d.rect matn.v.avsiied la«r de.orpiion ion.sation n,ass .speciromcirv 
(Sirupai CI uL. J 994). * '"'"ciry 

Although most mean.s of protein detection give some indication of the quantiiie.s of 
protein presenL in general they cannot be u.sed for global quantitation. This is because 



Marc R. Wilkins « «/. 



D. 



.:on.. „o.„c.„c po>„,c and am,„o I d ~Z•""■'^■"•"°'""""■"■ 
;h.r. a. iar« diff^nc. .„ «a,„,„, pa-.r-fX^ i.^^:; „ 
in diffsr;ni mams, mcludins amido black .miH.^^i 

.ono.da, ,o,U. or coo.a„.e b,u. .To^rFoMX^Tr,;"'" o^t 
iTie ir.u.i common msanv of quamitaimp i™ " J^^ ' • Oniz r; ^//.. lyo:, 
.nvolve. .h. rad,oiabcll.ne of^roZ^^^^^^^^^ """"^"^ °^ - :-D ..I 

quant-ta..on huscd on /luorVapby and ima^r anTl'L' f^^^'^''" 
'CrrrJ. J989: Cd.v .nd Olscn. J994, Houe^« L^^^^ 
incthion..if cannot hf dstecied if onlv /"si nJh.nn ^-^"t^-n 

acd anal> s.. of pro.c.n ..pot. -I'^u.U^cd b.oZ ^^^^^^^ ^^^"'"-=- Amin. c..: 

proiein quantitation for the future. ' P^f ^ents a likdy meyns of 

All 

BLOmvc OF PROTEINS TO MEMBRANES 
Eiccirophorftjc blotting of protein, from tx^-rv.^;^ 

membrane. present. ma^vopuonsTorpr^ Po'y-cr> lam.de cd. ,o 

-h.ch are no, po.Mbie uh Jp,otein. rem! " '^^^^^^^ 

blotted to polyvmylidene diHuonde ,PVDF. membmn.T f^^"'^'^-;^'^" Proiems are 
lermmaJ '^equencnE. ammo acid anaJv.i. or mmnnnM ' ^= '^^^^^^icd bv N- 
■o endopro.e,n.e digestion, mono^a^ ^a"^^^^^^^ 

nuurix-a.MMed la.er de.orption .oni.atiorm?.. ^^^'''^"^'> '^*^- direct 

wi,.in.../..i995:jun,b;rt.„/:;^"^r,r^^^ 

VWiztnandler a al.. 1993: Murthv and lubal 199 Fri \ 
possible to combtne of some of the.e procedte o' , sin^^^^^^^ " " 

nK-mhr;.ne r Pucker ca/.. 199- Wjikms rr ,/ u " ^ 'J'"' ^ PVDF ^ 

Th. . useful u-ben nimimaramo:^^^ -''^n^^- -'^^ ' " 

techniques u ,1) be explored ,n detail later m th ! r.^ k ^hcNC EL5 

.here are sotne disadvantages assoc a te vu J" ^^^^^^ No.u-.thstand.n, the above. 
There is ahv.ys loss ofsampledurmnblottin 1.1^^^ c ^'^"''"^ "HM,,brancs. 
'V9... and common protein detect on S 
membranes ,7./,/. /, p,,.en.mc diff, ult°e 'V^h^ T 
prcMc.n. Detailed discussion of ihe mcra of ava 1 K.'" " '''' 
Hiott.nc,echn,que.canbefoundelseu;Tr Eck,^ "^^^ ,7"'"^-"- -"mnn 
i9W4: Patterson. 1994,. ''''' '^'^"'^o^-^ndLou.pc^ch. IW.VStrupai 

:-D gel analysis, documentation, and pro.eome databases .'l^ 

'0 catalogue all .po, 'from th^^D ee m "^^ - 

numner. so as ,o dcfme the number of pro e.":?:;'^ H-^^'^ '^-ntitati ve 

Reference gel .ma,es. con.ruc.:: fL~o' 7^^^^^^^^^^ of expression, 

dimensional gel databases. The.se databases aKo «n, " ""^ 

ajut,ase.s aJ.so contain pro.e.n sp„, identities and 
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^'"f^^'SS ^n,h prnu „mc ,,rn,ec,K 



(N>.dhardt e, a!.. J9K9: Simpson aL 199- ADil^r ^ .on''' ^^^^^'^ 
daiaba^s. coniamins DNA sf Qusncc datn rhTnmrr . " orcwism' 

D .„d pro.™ fjncona, .'fc^'ro^ri™^ 

eenomf and proieome proircK proercs. (VnnR^I.. •^^^/^^^om.n.c e<iabjKhed 



CEl IMAGE ASAIVSIS AND REr£R£NCE GELS 

After :-D tlscirophorssi^ and protein visualiwtion hv c, « 
pho.phonmasm|. imas... of sds are did.i^d fo " ' ""°rocraph> c,r 

-anner. la.cr den.uo.er. or ^har.c/coS^^ ^" 
C=U. ./.. ,090a: Uru.n and j'ackson 1993 AluSfl '"^'^ 
resolution of J 00 -200 mm. and can detect a uJe«n J . ^^^^^^^^ "'^ '"'^ ' 

or more .re, .ca,e. ,. PoMou-.n, .... .eZ-m: e 1^:^^:^ ""^'j '-^^ 
pulat.on. to remove venical and honzontal sire:kin7unH K '""^ 
^poi posuion. and boundaries and to calcubte t " background haze, to detect 
^po. (SSP) number, contamm/vemca a^^^^^^^^^^ 

a...ned to eac. detected spot and bTc^l^e' ^ r';?::^^"^^;?^"- 
list, .ome notable software packages vvh.ch process rDVenmr^^^^^ ^'"'^ - 

Tahl, 2: Some S..f.uarc Packages f„r,hi A nalvM.s ..fCcI Imaces 



=.-ilE - i: 5 
GELLnB I i. n 

MELA ME I A: i; 

OlEST I 4: II jnd PDQL'EST 

TVCHO A; KEPLAR 



Ol^cr 3nJ M.licr. IVK8: U ,nh f/£,y ,yvi u,r,> 

u. L.mun and L-pmn. IVV.V LemL.n. U u ..nj I pinr, Loi 
Mvrn,k rr III.. |yvj» " i-pmn. IV9.^. 

Car„^ — .H......,.,«v:c....„.. 

I'*"-. RiLh3rd..,ft. H.rrr .,nd Andcr.„n 



. nf rri-rrn.c^ arc 



nn: r,naus,nf .n.iudf .nmr reference n, u.r „ 



Cll ;iv 



lUiMiiri III liif 



A. there are difficulties m the elecirophore.sis of .ampl-. u-„h lor.- 
reierence sel imaces are often con.Lr.. t r ' r^producihil- 

^GarreK and F.nza. ,989: Nefdtrd " 7 989 Im i"^ ^^'"^ ^'"^'^ 

:000 to 4000 protems from one eel to another '^Dre sen, ^ °' 

-up. anah s. systems. Matchme of gel ,v "^^^ 

n-.anually destsnates approxtmat-K 50 or o n" ' ^^^o 
•o be cross-match-d Protem^u h .^k I P'^'"'""' ^P^"^ as -landtTiarks- on o.,, 
--puter-bted -und landma^kL 
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JcnM.omc..r. ,B , Gel .mace after proces w «mL Trll^^ /w"' ""^'i-*'^ »^ "HiurcJ bv l.scr 
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CALCVLATION- OF PROTEIN IS(- HCTRh " POINT AND M0L€CLLAR U-E,chT 

Eviimation of \hi i;o-icarir poini ipl: and mol-rular ueichi r\I\V,nf „rn.* - 
:.D p:ov.d.. f.ndam.nul paran.... for .ach pro>;.n. '^^c^^^:::^^ 
during .drniincanon proccdurf. f jllowm.c .ea.oni. The pi and MW of nroLm 
ar. recorded .n Z-D «l dauba... Accurate c.:ma:ion. of pro' .n pi a d M V canT; 
ohtamed by u^ns 20 or more known pr j.ein. on a reference mop ,o con^ruci ^uZ^^ 

MU of unknoun proteins (Nc.dhard. r; «/.. I9S9: GarreN and Franza 19S0 V 
Bo.elen. HuMon and Ne.dhardL. 19%: Anderson and Andcrvon 1 Anf ' 
u, I 9,u Latham ./.. , 992 Ai.rrnnvely. .he MW of Z 
.0 P\ DF can be determmed vrr> accurately b> d.rec. mas. spectromc^ e"l , 
c't al.. 1992,. Where immobilised pH cradieni. are used Th. fn^ 
protem. a,... .he:r pi . be measL Vnhm 0^^.^:^J:^^ 
ammo ac.d sequence ( B jeHqM., a oL J 993c .. 1, mu« be noted, hcnvcv-r hai n n 
carnm, poM-tran.ianona) modiHcation. may ^^ra.e .o uncxn'^d Tuu- 
positions dunns electrophoresis (Packer 1995). ^ ' ^ 



Is 



.Sr»OT OL ANTIT-^TJON ASD EXPRESSION ANALYSIS 

A maior challenge faced in proteome proiecis i> ihe uuantiiative :,n-.K . r 
separated h> Z-D electrophoresis. The mo.t accurate mcrro or^^^^^^^^ of proteins 
.0 determine cnem.cally the amount of each protr;:::;::^^:^^ 
posn.onal analysis. However, .he current method of choice for nuan^i ta ,ve at k 1 
of many proteins ,s ,o rad.oiabel samples w.th [S] methionine or 'X ammo aad 
perform the Z-D electrophoresis, and measure protein levels in disiL "r L ' " 
minute ,dpm, or units of optical density. Quantitation .s achieve ^rL:: 
^.mt lat.on countm,. or h> ,el ima,e analysis .here spot densities arc quant t'i d 
h> rei.rcncr ,o eel calibration s.rips ccniainm. knou n a.ncunts of r.Ll .h . . 
pro.cn or ..a.ns, ,he .n.ecrated optical densny of all spois v.ual.scd . V . ilk U 

- al.. 1990: Ceiis ./.. ,990b: Cel.s and 01s=r^. ,994: 

Ga.eK and Solter. 199V Fe> ...... 199... All approaches cffca:! k' ri;; 

L m Zn 1 — C loaded o„u. .h- : 

Limitations thai remain w„h rad.olabell.nc methods are that ahs<,h„c cii mman.u, 
no, achieved because all proteins have va^^•lnc amounts of anv am.m, a U 
on^ easily labeUed samples can be investigated. Quantitative 1 
. aherr^ativc .G.ome.ti n a!.. 1991 : Hamn.cion c, uL. 1992. Rodriguez v^ri w^^^ 

- a of'!?- ^I'It ^^"^ i"'sj.hiourca . w^n::; d sa • 

j.hj 1. of cxircmsly high scDMUviiv. ^.'•fr.. 
Whtn projt.n .pot< from s^mpl-. prtparcd undrr d.ffcrcni condiuoh. arc qu jnu, „.d 
.:nd n,a..-h=d fron, gel ,o p=l. „ hecomc. po»,bl. ,o .xar.,„c dun«c- a^d na Z' n 
rr=.c.„ c,prr.>,on. Large .cale .nv.M.gauon of up. and do^vn.rccJlauon o' ' o" „' 
meir appearance and disappearance, can be undenaken For e.amni, 1 
.un.<forn,ed human ker.,i„oc.v,e. „ ere shown .o havl ,7° p':, '^utle TnT^Z „ 
rejulaied pro.ems compared .0 normal kera.inocv.eMCelisand Ol 1 W4, L, , 
Svihcsis profiles of 1200 pro.ems have been eMabl.shed in Mo^ce'l mou'e cr^b,^ „ 
.La,ham„„,.. ,991. .9«,: and a pro.ein.s ou, of,,,, were found 0 be ma r-f" 
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cudm,umlo^,c^yInu^nanprolCln.f^h-^c«Cf;^/.. J993, Cnrrrl.v , u 

in prottin rxpr- c^ion a; y rssuli of gens disnjpuon^ have nicn k.- " ^^''"-"^ 

an. P N,o-, -U... P„.o„„ co^^uni^u^nuLt^vr^"" ^^^^^^^^^^ 

prccm jxrrtSMon under diftem condiuon. can b< Mt'llZT"'"' 

-a, ,i,cal n tihod- iha, find sroup. of relaied ob.sc« u.Thin ? 

ca.h j-l , onum. pu:,nu,a,,vc dau for f«M„f „ .^^^^^ -"""P* " 

pmc„„al for ,mcs„.a,lon of cellular conZZ^t^'TI!^ !'''''^^^^^^ 

.mn«„«. „ dear .nv«u,«io„ "r«" c^^^ 

currenrlv ,=chn,cally unpo.sibl. usmj nu%l«c.acid b°^d ° " 
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Hum. in kiTjirnii. \ ir daijrsai* 
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' AfL-.-nnc Pffiicin 
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Rj: lr \ rr j;u.ihasc 
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ISS rDPACE tnniam.n- 
ituir.ji, rricrcn.c mnp^ 



^ c.i^i Pniifin Datnna^f i VPDi 
•inU \cjyi ElcL-irophnrciif 
Pfiiicin Diunhase i^ EPDi 



Cci vpoi. unuo u.tih GcnBanL 
.md Kimara Linnes; quamitaiivc 
sptit measurement under difi'cr- 
cm jrrdwih k-nndtlions 

Idcniifitaiion ni disease markers; 
Hv o separate daiahoACN ha\ c 
hecn ewiahlivljed 

Exicnsivc idcniiriLjiinnv 
uuaniiiaiivf Npnt mcaturcmcnn 
of iran-ifirmed lcIK: ideniifita- 
Kim lit disease mariners 

Ouaniiiaiivr vpm 
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' II' - tell staec 

Diicumcnti t hancev due i:. 

cxp«'surc I., inni/ms: radi;m,.n 

and iMxu vlicini.al> 

Drt.itlcd NUh.fliular 

ira.'iiiinaiion »iuuir* 
Etirntuf tiudifv tin rcL'uJaiinn 
"I prMiein. h> druc^ and i..x,t 
aprnis 

Atcc*«ihl- via Wt.rld U iJc f, 
uuantiiainc vp,ii measurement 
under difiereni Limdiimns 
Aft-evMhlf V,;, u„rld W.dr VI ch 
n>mplcte»\ iniccraied u nh 
SWISS-PROT and 
SW 1SS-3DIMAGE 
Ciimplcii»l> criKsrclerented 
iTsantm dauhave. VPD ha., 
cttcntive iniiirmaiiitn on over 
.'.VK» proieinx; VEPD has 
fnanv idcnuricaiinns 



VanB.>s:elcr. and Neuliiardi iwv, 
\ anBi->?clcn ri al.. iwv; 



Baker ft nl.. iw^ 
Cnrrvti ct til., ivwjs 
Jun-rhiui rml.. ivwj 

Cell* CI III., iwwoj 
Cell* i7 al.. iwyi 
Ccli» and OKen I»*vi4 

Laih.im r; 
Latti.ini <7 |uo2 

Ci..n-..-t.i. T.i>l.,r and T..||.,Ut-n. iv^r 



V^.n:,r;„/ . )uvi \V,r,(,,.,„/ j^u; 

And.TvMn and AndiTvnn. luw]. 
Anij.Tviin r; <//,_ |vy< 
RuhurJv.i, Hxrn and Andt-rsiH. (ugj 
Garrc-l. ;ind Pran/a iv.XV 
Boui.-ll n til., jywj 

Appcl r; „/ ivy^ 

Hi>»n»irj»viT r/ m/.. |uv"< 
Hurhvx n ,il . ivv.i 
Ci'lj/ rf III., imy^ 

CarrcK r/ n/.. ivvj 



p. ,. . . 

of rtfij 
>houlJ 
Macini 

annour 

One 
SWISS 
1904; 
feature 
:DPAt 



Table 4 

All thrc: 



InlttniL' 



Anntita 



Refcrr 
Daiah. 



Other 



FEATl-RES OF PROTEOME DATABASES 



Protromr pro,=cts rdy h^vjiy on computer daiaba«x ic Mor- infom,.,.o • 
pro... expressed by an orp.... -Prco.. dauba...":;o:,d^^^^^^^ 
mformai.on of prot.,n> already choractcn.ed clscvvhrrc. a. u dl a. proterd:u:^m 
:-D 2d. .uch as apparcn. pi and NfW. expression level und-r difiVrem c„n V 
.ubcelluiar lo=al.«uon. an^ informauon on poM.tran.lai,o;ul mo^^^^^^^^^^ 
of reference :-D eels. sScums proie.n SSP nuXr. .nM ''"••'=c> 

.n„o.,=. ,0 ....... or ~'^^;ir^:::r^z:r:::r::: 

sequence banks. "utitu a^iu 

One example of an intesraied proieonje daiaba« th- sun? nf cvvicc 
SWISS.:DPaGE and SWISS-JDIMAGE d:uabas=<Unn;i , if J' 

199.: Appe,. Ba,roch and Hoch«r.»=,. ,9,4: B.,^ h a'd Bo ckn!^^„ '^^ 

:DPAGE and SW,SS..,DIMaGE arc ucces,ibic ,h,ou,h .he Wnrti WidT 

Table J: Tn- SU ISS-PROT. S\VISS-:DPaCE and SU'ISS idtm Arc 



Infnnnainin 



Aiiniiiaiinnv 



Cr(>i«- 

Rf fcrrncfd 

D.i:.in3iCi 



Oiftcr Feaiurf! 



SWISS-PROT 

Tl'xi entries of sequence data: 
Citatinn infnrmaiinn: 
taxomimif data. ^0? 
mines in Rclcajc 



Prni-in run;.'iM>n . 
Post ir.injl.itmnal 
mmJifu'Siiiins. 
Diimain>. 

Setf>ncjr\ sirui-iure. 
Ouaiernarx siru.ture. 
DiNCusfv. avsti.ijied 
n iih prtiirin. 
Sequence innriiris 

SWISS-ZDPACE 

S\^-1SS-?DIMACE 

EMBL. PIR. PDB. 

OMIM. PROSITE. 
Medline. Fl>•^la^e; 

CCRDh. MaiieDB. 
^ onnPcp. DunDB 
Na\ leaiinn tr> oiher 
SN^'ISS daiahasfs a;.hirvcd 
h>> selenmc rntnes wnh 
c omputer muu.se 



SWISS- :dpace 

2-D eel imacev of- human 
hvcr. pia.^ma. HepC2. HcpC: 
seiTcied prnieins. red hlixid tell, 
iymphnma. fcrchro>;pmul lluid. 
mai-rnphacf liLc tell line. 
cr>thriilcuk.emij lcII. pl;iii-lci 

Gel iniaspv t^herc 

prmein i» lound. 

Hi.u prntcin idcniiricj. 

Prtittfin pi and MW. 

prinein number. 

niirmal and paiholnvu-ji 

V jriani'i. 



5VVISS-PR0T and all 
"ihcr daiah3<>cs 
a».i:r»»ihle thrnuth 
SWISS- PROT " 



Gel imaces shuu poMimr 
t>f identified proteins, or 
rcifion of eel «hcrc pnnein 
should appear 



SWlsS-.-»DlM..\GE 

Colic. linn of }M) '-D 
imasc^ III priiicin> 



Ml .'^nniii:ii,,.n is 
.is;iil.ihU- 111 SW|S^. 
PROT 



?^WISS-PROT and .HI 
other datah;i..cv 
;iLi.i'»^ihK' tliriiuL*li 
SWISS-PROT 



Miinn and stereo 
imaccs u\jilahlc. 
Imaees can he 
transicrrcd lo Intal 
ttimpuier imjce 
vieuinj prtirrams 



Marc R. VV iLKINS « al. 

( Bernert-Lee ei al.. J 992 ). allouine anv comn 

cur bt stlccitd wuh a compuier mnu«. From th- - d'.MK'^"''' °" '^'^P'^-^ -"'i 
ab( ui u proicm. .nciudms ammo acid .cpu-ncr Inrf L n7 '^"""'''^ 'n'ormai.on 

.m; can b. v..u ed if knou^. ;,nd the ''d f^^re « T"''^ " °" ^ ^^^"'^^"^"^ «' 
-aiiublc. R=f.r.nc.. .o nucle.c add and o^^^^J^^l'^' "'^.'^^"'^ ir 

Orcani^m" databases, contamms detailed Drnt^;„ . 

^..u. a spece.. arc becom.nc common a7:'„^om/and n -formafon 

Thc.e differ from nucJe.c ac.d or proiein seouenrTH u ^ ^'^'^"^^ r'-^=re>.v 
PROT because ihex are ,ma« ba Ted and "In^^ ^^^^^^^^ 

map posn.on.. transcnpt.on'of senes and nrotTir ^^^°'"o.omaJ 

.hcrniua cnl, ,ene-protem database I VanBo^. . u^'''"°" "^^ 

VanBo,eien and Ne.dbardi. 199^ [',^,7^::"- ^^'dhardi. 1990: 

EC02DBASE. one example, h con.aTnfe^ ' Un ^ 

.nfoniiauon „ncludmc pJ and MW es.imaiera^d l^t T'^r ''^ '^^^ ^P"' 

mation (GenBank or EMBL code. chromo.oniTn ^ ?enciu- mL 
.Kohara. Ak.yama. and hono. 19877 tran c'^ '!^^^ 

rcgula.on mformauon flevel of proie.iTxn " ' °^ ""^ P^"''"-'" 

referenced to ihc SWISS-PROT database (Bairn.h ^^^'^^-^^^ aUo cross, 
anticipated thai organism database. u-ilUoon h.rT Boc^i^mann. 1944). l, i, 

-ailable informalion about a p^^^^^V^:^'^^^^ 

consistent manner tn uh.ch oreantsm da.abL. are ass.mM TV' ""'''""^ "° 
comparisons in the future. JSNcmblcd. which max hamper 

Identification and characterisation of proteins from 2-0 

Thrnumber of proteins idemiried on a :-D reference nv.r, I 

a research and reference tool. .A. most reference nv^^ K ^ '^'^t^mines „s usefulness as 
proteins Identified, a ma.or aim of cur en, p^cTeo^^^^^^^^^ ^' P-P-'u>n of 

Torn :-D mapv m order t<, define th-n a' LnoZ ^ ^"'^^^ '"^'"> P^"'^'*"^ 

datab..es. or a. unknou-n' Protc'; dVnt r," L' a"?"' ^ 

open reading frames, and provides focus fo DN a ^-^"^'^'^"'"n DNa 

Characterisation effons bx pointin/ro prorc^^n^'^hl"'""""-' ^"'""^ ^"^ protein 
30(KU40C)Oprnte,nsfromLl«le" Dm^Th ^'"^"^ '"^'^ be 

protein screening .s to -den.;"^;^?^ ^^^^^ 'dentiHca.ion. the challenge i„ 

Traditionally, protein, from Z-D .^eK h ' ' k'- , " T'"^"'^ °^ -'^ cfA,n. 

'".-"oblott.n,. NMerminal-nJi^c; 't:^ ^> '-^-^-^ -^"h 

^■om.graiaon of unknown protein, w.th knoun ^'f'"'^'' -cqucncinc. 

'H>molo2ousccnesof,n.ere.hrtheorc^'l °^ ^> <>^'^^^^Prcs.,on of 

.^9:: vanBo,cie;::,:;";';9 s-^^;::;:^;-^^ i^s7:Rosc„fe;:; 

-~approach.oma.pro=:V^-^-:^^^^^^^ 
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Table-: Hifrarrmral analvMi for mas« jcrrenmc of ' n 

Rapid and .n-,-rr.Mxe le.-^n.uuec ar- „^d a^ a f.m ... nu-m^ 



Orj;r Jurniifiraiinn ir;nniow 
Aminii aito ana isn 



5 
9 



PrpiiJe-ni3v^ in errprin.mc 



Cmhinaimn <.i amim. at-.d ar.alxsi^ anj prp„d- 
rmc-rpriniing 

M3« spr.-irom-tM ^coutriLC las 

Exiens.x-- N-tcrmtnal Edman m.criKeouencin^ 

In.ernai prpnci- Edman mirro^qucncinf 

«-nrj> ic^nisaMfin. p,«|.S(.urtc Je=a> MALDI-TOF» 
LadJcr teuurn. inB 



y^illiny citiL. vuhftiiiiciJ 

Mann. Hnirur mj R..w>> . 

Suliiin etui.. |vy5 

Cordw erir/«/ . IV05 
^ asinger ci al.. )«v,<. 

Mann anJ Wilm. lygj 

MaiMidatra. IVK7 

Roscnfeld rr «/.. jvv;; 
Hcllman r/ n/.. ivy5 

iohnson anJ W jl.h. JVy; 



ull.rnunv, ,o .r;jd,„on^l approaches ( TaM^S: Wa.i„,„«<,/ , pc , -r. 

ust of rapid and cheap .djm.r.cai.on ,ooh such av amL ! ,^'^'"™'"'-'''' 

slower, more „p=„,,v. and ,imcconsum,n.idt,Srn r"'" 

.he cohMniCon of ,hi. h.erarchv ,he an.Ksule co"^^ , if "cocsar. . In 

of ,he da,a crea.ed has been considered a" whi r S^erch'' 

macn.ne „me per sample, ihe analvsK of data can '«hn,,uc> require li„le 

consummj. Amino acdanaivsis and peplid- mass nl^^ """" 

icchn.ques ,n ,he h.erarchs are discus«d „"d-,a K^f '^r'""' "J-'-n—on 

.den.,r,ca„on ,echn„aes m T..,. ,. see Pa,re;ro::r,°; , a^nd 

"ROTEIN ,DESTlr,CAT,ON BV AMINO ACID COMPOSITION 

.Uc;uf,caMon"?pro[e,nslLm^'.D"cels ifteTlarlv u"^ >T"= ^"'"^"^"'"n for 
This lechnique uses a proiem's idfofvi^I™ ' n 'nn^ ' "' 

.dennfs „ hy comparison „ „h * o^" a, coZo 1 "T"""" "'""'^ 
The an,mo acd compos„,on of pro,e?„?ca„ dT-™ u ''^'^•'-^<^- 
-d,olabel,.n. and ,uan,„a,ive amo Xfr!p^^^ af ™'d • 
"/.. 1 99J: Fres no/.. )994| orbvacidhJd,;!? /" 

.-hroma,oeraph,c analv,,s of he rtsul, t ammo = T"*"""""'"^ ^"-'^ 

1995^ AS differential mcabol.c 1^6.^/; !oerimen, ' ' ^^'■"'in^""'.. 

phor.imajepla,ee,po..resofup fuOda " 
-olabened .a.np,e.s. .he .echn,^ . 
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i:ruph>-hused jnul\ SIS. Proteins bloti-d in PVDFmeinKr m* u . 

.11 ir.-^ C.jminoacid^rxtrjciffd ma .innje brief sien Mnrf P-irK -"'"'.^^cuin i h 

h n protcns have been prepared by micropreparaiive :-D cLxror^hc^r^Z , u u 
Afier the am.no ac.d conipos.uon of a proicin has been dciemiincd con,„... 

m.. M.,chi„, „ o^fni'- jrj'^-.r;:'r,:;;;;'j 
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''"'•sain*.: Seq. 

K X. X k S 

if S H I X 
« S H $ K 
K » I IC Y 
K » 0 T Y 
K A 2 E 0 
n K H S L 
H X. H R A 
M S S X L 
K S S K : 

Fipurt 5. A P\ DF pmiein «p<.i trom an £ ,,.h 2.0 rficrt-n r mm . 

..mc omplc .n.n .un.cn .m,n., 3„a.> s.s The S.^,;::,u;^::;::::::^r\": to?*'", 

a.KUnmr,..m«nnl Hir vpoi. .-(v well a. csiimaicd pi and M\V uprrm-„.K., '''^ ^Mien Hoe amini. 

.he ...rre.-, prn.c.n .dcn.,fu-.-,i,.,„ However. ,hc .cuucncc" L M L K R ? '"".^ ^""'"J^'"^--- ""^ K-.nj: 
proiem serine hvdr<.vv meth> liranvlerj^ ^ m t.mi.rmcj Ihc ij fniiiv III the 

tr. piophjn urc dr.troyed dunn? hydroly.i.v. a.paracmc and dii.ammc -.re dca.md-., , 
UMh.r corr..pond.n.c ac.ds. and proline is not quJnuta.cd M,nra ; "^^^^ 
Th. computer programs produce a of hcs, maichinc proteins u-h,ch arc nnkl j K 

m.,ch,ns An example of prcc.n ,den„nc.,ion bv ammo .c-,J a„„p,-M,K,n .hr,«^ 
in r,v."T J, To djle. amino acid compo.iiion hav hecn uvcd lo idcn ,i\ , 
reference map. of ,,,„,,,,,„„,„ „w„rL„. «,,.,.„/„r, Z' 
'"'"Uf., <«-n7„n<-. D,cn„.wl,„m d,.u„i.lr,„„. human vera luin^n h^ n 
l> mphocyie. and mouse brain iCordwel, „„/.. ,995 C„"c T ^\^r 

When sample, from S-D .«,.< are „oi unambiguous,,- idcmined bv 



amino acid 



'6 M-^RCR. \V|LKl\5f/a/. 



c .mpoMtion. pi un.I MW. ofi-n ihf corrs n ,d^n„T. 

I .;P r^nkm.. of ,h. Ik, . Hohohm. llouik l ^^^'T '^^"'^'^ — n^. 
^rectromc:n -.cqurnc. couc.p, -Nlann „ n I'So - " '''' 

which .hr .umc sample . uscd for am.no ac.dtuK-' "1^' ' T'*"'"" '^"^'^^ 

I man d..rada.,on cycl.. ca. u.ed NrodincSon f " ' ^' ^ "'•i^ 

Pro..,n. per ueek on one auromarcd. mul "ciriJ^^^^^^ '^'f' or c 

MHon. p, .nd MU or pro.e.n. are match d ^^''^ -"P' ' 

Vtrrn^mal sequence, ofhe., matchmc pro.^mTll^^^^^^^ and 
•o confirm the pro.c.n ,den,„y iF.,,;/,, ^^Z''^''^"^ ^^"'^ '^^ sequence la- 
pro.e.n. are N-term,nall, blocked, hut a. on v u TJT '^^^ "hen 

-^cent.hie to ,he a.eivi. formyj. or pvrocJuiamVl mo rr """"'^ ^'^-'J^ 

.Keif provide u.eft.1 ,nforma.,on fo TeauTn^^ '^^^ ^"^"^^ 

N-irrmmal sequence lac and amino acid cor^ ' 'J^uncai.on. .A Mrcn-.h 

-nested are qtnc.i>-and . 'dcnt.Hcat.on . ihll; 



T^chniqujv for iht idtntificulion of Drai^mv i, 
r.c.„„, h=cn d..cr,h., ,H=„„, <-° XTi . XnZ' H "■''='^™„„= have 

"^'-h.,,. A. pro,..-,n- have diffc7em "'V -"""-""^ P"--n 

...n;,^M;r;-rp^=^:i:^^^^^ 

J..«M- .rc r.por,cd ,n produce more cnz" — ' ^'""""S--" ^'d 

j-Ko been u.ed ,P.pp,„. H,,,rup and BIc.r t^'""-"- '■'^r'^'^^-'l-o 

Pcp..de^oh,umrd.nj.dt.,r.Weforpro,«„^™L K """ibfr „r 

bonds of , he pro,e,„ broker, and produce'; Ir™ •*" '^'^"'M, 

."..enuWe ,o d,je-„on. Surpr^.n.!.. chTnt" ^e 'h^' -ure 

hron„de ,me,h,on,ne <pec,f,c,. -formic add f """""^ ^> 

n"rophcn>K„lfen>,,..-.-me,hv|.v.bromo,ndol „ ■'"•^ 

lv«n explored ..mean, of pepudcpr~ ^r"-T.c, have n™ 



fNikodfm .nd Fresco. 1979: Cnmmm. r/ «/ joon-v o ' 

AAcrpr ..n. arr d.r.«cd.pepudr m^„^:^,d^^^^^ ^^92, 

D,r.r: .n.l> of p.p. d. m,.x.ur.. can he ach.- ed h.^L " ^^-^^'^-'Ctn . 
^nrcrometn . pj^.^a d-.nrp.ion nu.« .periroml^ ' ^'^"^^ ionisuuon nuS. 

h,chrr .cnM„v„y anJ.crca.er lolerance .ocomJ!; ^ ^ ' ^''^•^•"^'^ OJ 

mor.. re.en, mod,n.3..on. -o .san,p,rprena^^^^^^^ runAc 

\ orm .nd Munn. JOQ.. Vor.T.. Roep.torff w m ^' '^^'^*' K^VJ 
.pe.ro.e,n .i,o.. . fracl^or 'd' ^^^^^ -"sa.v., of 

A major challenee a.^ociaied u ah peptide mas. Hn™ • • 
pnor to computer match.ng a,a,n.. TbLe.^ ht^c,^^^^^ 

mu<i he examined curefullv ,o deicrmmf uh.rl . Spectrj 
'".ere.. . there .re oU^.^n.:^'^^^^^ -P— i peptide n^us^Tr: 
^^ance. present .Henzel rr ./.. ,993: Monfr T'"/';'"' -ntammatin, suh- 
Funhern^ore. tf pro.etn alkylat.on and rcduc^ofha^. k '9'^-^.. 
Protetn d.,.eM,on. peptide sequence covera." mav t\ Inr 
-a-es p.. ,n, repre.enttn, dtsulftde honded p^p^de, '^J.lT '^''^ ^^ "^ 

by tne pre.en.-e of po.-tran.iat.onai modincl" o^^^^^^^ 

unrnoduiedpept.de alone can he vcr^ difncuh^^^^^^ "^-^ of ,he 

canon, .ntroduced hy elecrrophoresi... aracr^'am^^^^^ 

oxid.>,on of mcth.ontne.urearso known Wncp^^^^^ ^"'^ «he 

i^-...„„^;„ n„.j.r,.,a,„„, o.mplua.rd., " aKs 7 P^rndcs ..■.,r^,n.. p...,- 



"v. CIV l.iiinn 

• '-'.■.iniiU.iiii.n ..I Axn ..r Gin 
•"^'-uliidv hnnd inrm.iii.m 

"•'rm\ I.Tiinn 
H--u.>r. ,Gi,. Gj; 

■-i>drn\\l3,|,,„ 

^_:^T..xlh^^..^.m,nf. ,CLN v. c.fNAo 
°n■l^,p|„1^^i3l,„n 

•■'^"J'-nned (r..,n Cin 
■'^'.-«ia-.V.d(N..u\Ac. 
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databases .rrv.-ucd in Coiircll. 1994). Mairhin« is usuaJIv u^S;^^^^ 
t.ve manner. vvhrreK peak, of ma.ss iOO-XOOODa are .eV-i-d^" ^^i"'"*^'- 
■ vanou. search parameters .ncludm. MW ofprote.n. rJ^^:^^^ 
number of m.s.ed enzyme cleavage., alioued .HenzH ct ctl i9o- <, °' f ^ f ;f "'^ 
Ra.mu..cn.„/.. ,994KThecorr;aprotem.den ■ v.^^^^^^^^ 
pept.de masses m common wiih ihe unknown sampfc Idema" l^v.t ' 
u nh as feu as thre. prpt.des. but unambieuo^d^ni ^^ 

mass .pcc.rom.mc map covcnng mo.s; P^pt.Se of ^ "^VS ' 
Va:.s . ^993,. To date, pcpt.de ma's. T.J^rZTo^'^^^^^^^^ 
undertaken from the numan myocardial prote.n and ^Z.uno.\., maps "o „ an r ; 
-D gd. and from reference map.s of W/„,„.^ n,r//»;n„/^ ° ";^"f ' '''' 
rSuiionr/fl/.. 1995; Ra.smu.s.sen r; a/.. 1994- Heniel r;«/ too- " 
aL ,995. Wa.s.nger . ./.. 1995.. ai^ou,h .he fcl^e 1 mo j ot;:r^T:;:'' 
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MASS SPECTROMETRY SEQUE.VCE TAGGING 

hrn, ,c fruyn,=„,«,on by coH.sion ui,h a ja^. and n„uHv dLrm.nc he ,, „ 
fmgmrnis. The rr.ulun} spectra c,ve. mforniaion abom' . pep„d,' 1, i 

«,uc„c=.Th=ta.n,e„,.Uohmas«.o/p=p„d«cahr.rcK bcu%7o ?0 „. I 
^equtnct. hue n usually allows a shon 'sequjnct u.' of - or I :,„«n^ T '^ 1 

This ,a, and .h. cr,,.L pepu.: ^r.:^':^^:: 

.Fa,n., ada.aha«.prov,dmf aIikcly,d=m„yof,hcpcp.id=ahd >hc pro,c n ^« 
Th; major drawback lor th,^ it.-hniquc as a mass scrcsnin- tool ,.L rZ,Z 

m.. da,a .cncra.cd and ,h= h,,h ,cve, =f .xpcnrsT "-^rj^for ? ^ ^ll^^ I ^ 
increases ,hc power ofpepr.dc mas. r,njcrrr,m,ns pro.cn id.n,ir,ca„„„. ' 
Cniss-species prolein idcntirication 

Proitm sequtncc daiahasts coniinu; lo crow al a ranid ni- v„ „ 

apprecaicd ihai close ,o 90^. ofall ,„fo™:„„h cohiamc' „ '„ , J,:™,"^^^'-^ 

comes from onlv lOspeces ,A. Bairoch. Pers. Comm , Fonun H u "f 

m^wi r/ «/.. J yyj j. j fil\ jpprOJCh a OU ^ prOlemv frnm rrf-r««^- 
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Marc R. \\ ilkin'S et al. 



Rapid cros.^-spccics idrniifjcaiion of pmieins ^rom 2-0 r-fcrence ma©- 
undsnaksn uith amir.o acid composition or p.p,idc mav. fincrrpnnimr m«ho ■! 
^Fl?ure6L bui ihtsr -chniqu-s alonr ma> noi idfmify proie.n. u'nambieuou.Jv uhrn 

phylossnfticcros-.pc.ie^di^iancss are real or ar.alvsisd3t3j*of poor iuaJuv^Vn^ 

n/.. 199V Shau. 1993: Cordwcll n/.. 1995i. However, ver^ hi£h confid-nc- m 
protein ideniine^ can be achieved when 1»ms of beM-maichinc protein, ceneraied'hv 
both techniques arc compared (CordweH et al.. 1995; Wisineer et al' 1995) The 
correct ideniiilcation i> found when the same protein iv rankedhichlv ,n hst. of b-<i 
matches generated by both techniques. This method has allowed Lpproximaielv I'-'O 
proteins from the reference map of the molHcuic Jpimplasma mvllnemm r-presei^i 
mg approximately one quarter of the proieomc. to be confidentlv idemif.ed K 
reference to protein information from other species ,S. Cordu ell. Personal Communi' 
cation). When cross-species protein identification is to be undenaken it should be 
noted that the molecular w-isht of a protein type across species is usuallv hi-hlv 
conserved, but that protein pi can var> by more than 2 units (Cordwdi ci a! 1 99^] 
Accurate molecular weight determination by direct mass -spectromem of nroiems 
blotted to PVDF (Eckerskorri er al.. 1992) should therefore he u useful additional 
parameter for cross-species protein identification. 

CHAR.ACTERIS Alios OF POST-TRaNSL.ATIO.SAL MODIFICATIONS 

Man\ proteins are modified after translation. Such posi-translational modifications 
including glycosylauon. phospho^^•lal]on. and .sulfation (see Table 6) are usuullv 
necessary for prptein function or stability. Some abnormal modification-, are as^oci 
ated with disease (Duthel and Revol. 1993: Ghcsh a al.. 199}. Yamashita cr al 
199."*). In proteome studies, post-translaiional modifications can be examined on all 
proteins present, or on individual .spots. Studies on all proteins provide an indication 
of which proteins may cam a cenain type of modification. For e.\ample. ^-D -cl 
anulvM^ of cell cultures grou-n in the presence of ['H] manno>.e or ["P] phosphate 
givf . an indication of u h.ch protein- cam glycan- containing mannose and which 
proteins are phosphor> ijted (GarreK and Franza. 1989). Lectin bindm..: .tudie- of 
gels blotted to PVDF or nitrocellulose provide information on the sucJharides. if :mv 
thai are carried by proteins present (Gravel r; a!.. 1994). * ' 

When individual proteins of interest carrying post-translai.onjl modifications have 
been found, micropreparativ- 2-0 electrophoresis can be us-d to purifv them in 
microgram quantities (Hanash a al.. 1991; Bjellqvisi n at.. ]99^b) 'if protein 
isoforms of .similar MW and pi are to be studied, focusmc uith narrou ran-e pi 
gradients (I pH uniti can provide greater separation and resolution. After electro- 
pnores.s. the type and degree of protein phosphorylation can he invesiicated tMunhv 
and Iqbal. 1991: Gold fral.. 1994). mono.sacchar.de composmon can be determined 
.Weitzhandler a al.. 199.V Packer ct al.. 1995). and the structure and exact site of 
glycoammo acids can be invc.siieaied b> either Edman dccradation based techniques 
or by mass specirometn (Pisano ci al.. 1993; Hubenv er al.. 1993: Carr Huddlesion 
and Bean. 1993) W„h further development of rapid techniques. invest,raiion of 
phosphor> laiion and monosaccharides by chromatographic or mass speciromeiric 
means is likely to become a routine .step in the characierisation of posi-iranslaiional 
modifications of proteins from reference maps. 
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''"'^ Prnienine projects 



The stati-s orproieome projects 

;.lan> isrhnical aspscis of proicom: re^arch have ajreadv k ^ 

Advance ,„ pro.com. pro.cc. ui,, .„i^r„t 'Z;^'^^ "n^^ 

. ,0 enable an id=n,i,y. ammo acii ..ciuMTor ..on ? 1! """"""f 

each pro^m ,po.. T.i,le 7 .how, jcnomc ^^'.'om/s^ '™ d '° 

pro.e,n. already defmed for a number of model on^^^:"f 

£enor,e ,e,„e„cm. program, for £. „/, and S Zr"!^r J^ ' "T * 

..2= of ome o,her .enomes ,and espec.anv m/humT. 

complex nucleo.de «ouence. are unLlvti 4 vSefVrT'^ 

.h... :.D .eference maps and pro.eome P^ojecJ of cTl^^^^^^^^^^ 

/'torn 1,,.. r fo/, and 5. <rre,.„,„ will b- u.e mo., H , M l^kt A/yo.. 

U-3.„..cr „r„.. 1,95: Vanbo,elen « '1 1 9« "o^r 'w ^Jt^^ 

map^ of other orjanisms will take lon»er to con„n,,,' u complete 

'peces pro,e,n ,de„t,r,ca,.on .eeh„,,ue'" m ^1 ou^™ "» 

and .mple eukaryo.e, to be pan.al.y'defned ^^^l^ Z:^/:^::::: 

B,. , , cc.„.. n„,„™ nr i x;,?^"^^; 

Specie.^ Name 



M\<npiaf 111(1 >pSLiCS 
Esrhrrirlifa cnii 
Snrrhninnnte^ Cfirmiur 

r)i( l\,niri,„,r (ilU niilrilltl 
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e..m.ed a, L,,^... ^/pr^tet^, rr™,": ."n;; r ^ " 

.0 ...ue. con,,„u„„g the •hou.ekeep.n^- prote.nMB rd v, hT °" 

ptoiem. consiiiui.n!: a set thai are soec fic m ■, .... 7 » ' remainder of 

elec.ropbore„c eond.t.on, are ref^enc/ma ^m 
.~m can be <uper,mpo5ed m -el databltr.rc JT ' 

-celerateMhedeftntt'onofthetut e ; pL^^faT:^^^^ '"'r ^^'^ 
unique to different t.s.ue tvpes Such Mu'd.e. m,? f *"°^'''™""^"'-" 
P-.-.ran.a,ional modifcauon ' wh' h can feT nTe'am^ 
different „,,ue.s. Proteins tha, remain unknown a er id "n,mcar„ '^"'i 
useful ,n providing focu, for nucleic acid sequen ^.^^ T ' 



-i: Marc R. vvilkins ei ai. 

Fl-n-RE DIREmoVS Or PROF ONit F-KOJHCTS 

Thi^ rrMsu ha> dsscribsd rsceni advance* in ih- nr.-, «f « 

acd analys.s , a. udl as th= appJ,ca.,on. o; n.u- ^^nolol ;^^^^^^ ^"^'"^ 

^cpara.,onJd.ntifica..onanda„a,?s sofc..t^^^^^^ 'T'" 
po^ibic .h. ...abLshmcn. of d.JiefS™ 

htcommc ihr method of choice for th. ^J^Uol ^T. ^^h'^h :,r. 

.nv.M,saMon of frnr rxpr«..on therem °' ^^"^^'^ -nJ th. 
Proiromr projsct^ arc alrsadv impaciins on th- rf«,»,r,n «f , 

DNA sequence constuure. the def.nLn o.C eLfitrPo ' 

Of different us.ues of . s,ng,e organism are oS nlnc^u'dT " ^ ^^"^^^ 

cross-.peces ■drmincauon of protein, (for examn f 1 *^ • Simiiurfv . 

from Cn;, J;^. bv companson l^h 5 c^^j , f Proicns 

organ..m. thai are poori> molecularlv deHned aTc ' " ^'""'^^ °" 
proceed ai . pace order, of masnnude faTic^^in?r"'^"'" "icniincation can 

defining the gene and protein comDlem^n " '"^ P'"-""' "^"''^ o( 

sequencing of senome. u-,,) b a'^Td d an j 1 °h""'T '^'^ ^Na 

nox el. ^nipha5.s.placed on those found to he 

Ju^i as genome sequencing is not an end in itself neirKrr 

reference map of an organism, nor indeed the ^1^^^^^^^^^^^ '"""^''^'^ P^O'-'n 
So Whilst an immediate aim of proteome p c^^c 1^^ o ",1^^^^^^^^^ 

maps, this will i-ad to expression studies nnTrK ^ reference 

modiricationv The chanenVr t^^dftot^^^^^^^ 

«ruciur: and /unciion of proi=,ns in a nroijnm. ri *t invesiigalion of 

in w=r= ,n,„all> of no kno. n funcoi ^ot^ ^ f'ti^^^^^ 

-..nd.„, Of ho. „>-,n, or,an..-. a. co^'Ll^tird Z^^Z:^'' 
AcknoKlfdgemenu 

.MR\^- Ihe rfcpicni of an Auscral.an Posicradualc R,-.r-,rrh j . 

IHS and KLW acknoolcdse a«,sun« f^; „,™1 -^^C- MR^*-'- 

-U,hcSw,,.N3nonalFund'orSci.Sl.^arch?r 

™"".u.. u-ho ,upp,,ed work ^i^Z^-^i:^'^^:^: .hank 

in rrwss during the u riiing of this re\ ,eu . 
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METHODOLOGY 



Human cellular protein patterns and their link to genome DXA 
sequence data: usefulness of two-dimensional 
electrophoresis and microsequencing 



BENT HONOR L' 
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ABSTRACT Analysis of cellular protein patterns by 
computer-aided 2-dimensionaI gel electrophoresis together 
with recent advances in protein sequence analysis have 
made possible the establishment of comprehensive 
2-dimcn5ional gel protein databases that may link pro- 
tein and DNA information and that offer a global ap- 
proach to the study of the cell. Using the integrated ap- 
proach offered by 2-dimensionaI gel protein databases it 
IS now possible to reveal phenotype specific protein (or 
proteins), to microsequence them, to search for homology 
with previously identified proteins, to clone the cDNAs, 
to assign partial protein sequence to genes for which the 
full DNA sequence and the chromosome location is 
known, and to study the regulatory properties and func- 
tion of groups of proteins that are coordinately expressed 
in a given biological process. Human 2-dimensiona] gel 
protein databases are becoming increasingly important in 
view of the concerted effort to map and sequence the en- 
tire genome. Celis, J. E.: Rasmussen, H. H.: Leffers, 

H.: Madsen. P.; Honore, B.; Cesser, B.; Dejgaard, K.; 
Vandckerckhove, J. Human cellular protein patterns and 
their link to genome DNA sequence data: usefulness of 
two-dimensional gel electrophoresis and microsequencine 
FASEB J. 5: 2200-2208: 1991. 

Ar; Words- numan protein patlerns • 2 -dimensional gel Orotctn 
datahasa ■ gene expression • microsequencing • cD.\'A cloning 
linking prolan and DXA mjormation • genome mapping and se- 
auencinr 



Proteins svnthesized from information contained in the 
D.X.A orchestrate most cellular functions. The total number 
oi proteins svnthesized by a lypical human cell is unknown 
aithough current estimates range from 3000 to 6000. Of 
these, as manv as 70% mav perform household functions 
and are expected to be shared bv all cell types irrespective of 
iheir oricin. There are many different cell tvpes in the hu- 
man bodv with perhaps 30.000 to 50.000 proteins expressed 
in^the organism as a whole judged from the fact that about 
i'^c oi the haploid genome correspond to genes. Todav oniv 
a small fraction of the total set of proteins has been identified, 
and little is known about the protein patterns of individual 
cell types or their variation under physiological and abnor- 
mal conditions. 

For the past 15 years, high resolution 2-dimensional gel 
electrophoresis has been the technique of choice to deter- 
mine the protein composition of a given cell ivpe and for 
monitoring changes in gene activity through quantitative 
and qualitative analysis of the thousands of proteins that or- 
chestrate various cellular functions (refs 1-6 and references 



therein). The technique originalK described by OTarrell i 
separates proteins in terms of their i.soeleciric point ^pH ai. 
molecular weight. Usuallv one choo.ses a condition ol u.- 
terest and the cell reveals the clobal protein behaviora: 
response as all detected proteins can be analvzed both 
quaiiiativelv and quantitaiiveK in relation to each other. .A; 
present, most available 2-dimensionai gel techniques (regu- 
lar gel format) can resolve between 1000 and 2000 protcin.- 
from a given mammalian cell tvpe. a number thai cor- 
responds to about 2 million base pairs of coded D.N.A. Le>- 
abundant proteins can be detected bv analyzing paniall 
purified cellular fractions. 

Two-dimensional gel ectrophoresis has been widelv applied 
to anah sis of cellular protein patterns from bacteria to mam- 
malian cells (refs 1-6. and reference.-; therein). In spite of 
much work, however, information cathered from these 
studies has not reached the scientihc community in its full- 
ness because of lack of standardized eel systems and the lack 
of means for storing and communicating protein informa- 
tion. Onlv recently because of the development of appropri- 
ate computer software (7-13), has it been possible to scar 
gels, assign numbers to individual proteins, and store tht 
wealth of information in quantitative and qualitative com- 
prehensive 2-dimensional gel protein databases (4. 14-23). 
I.e.. those containing inibrmaiion about the various proper- 
ties (physical, chemical, biological, biochemical, physiologi- 
cal, genetic, immunological, architectural, etc.) of all the 
proteins that can be detected m a gi\en cell tvpe. Such in- 
tegrated 2-dimensional gel protem^databases offer an easy 
and standardized medium in which to store and communi- 
cate protein information and pro\ ide a unique framework in 
which to focus a muhidisciplinarN approach to study the cell. 
Once a protein is identified in the database, all of the infor- 
mation accumulated can be easilv retrieved and made availa- 
ble to the researcher. In the long run, protein databases arc 
e.'cpected to foster a wide \arietv of biological information 
that mav be instrumental to researchers working in many 
areas of biology- among others, cancer and oncogene 
studies. difTereniiation. development, drug development and 
testing, genetic variation, and diagnosis of genetic and clini- 
cal diseases CFig. 1 ) 

The approach using sysicmatic 2-dimensional gel protein 
analysis has recently gained a new dimension with the ad- 
vent of techniques to microsequence major proteins recorded 
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Figure 1. Interface between partial protein sequence databases, 
comprehensive 2-dimensional eel databases, and the human ee- 
nome sequencing project. .Appropriate software is required to com- 
pare protcm and DN'A sequences In general, although the infer- 
ence of a proteins sequence from the D\A sequence (thick arrow i 
IS direct and unambiguous, the DNA sequence can only be inferred 
ipproximatelv from the protem sequence (thin arrow! and cloning 
■f the gene requires either a cDNA or the requisite group of 
iligonucleotide probes deduced from the partial amino acid se- 
uuence. Modified from ref 6. 



in the databases frefs 24-42 and references therein). Partial 
protein sequences can be used to search for protein ideniitv 
as well as to prepare specific DNA probes for cloning as-yet- 
uncharacterized proteins (Fig. 1). .As these sequences can be 
stored in the database (see for example Fig. 2H). they offer 
1 unique opportunity to link information "on proteins with 
he existing or forthcoming DXA sequence data on the hu- 
-Tian genome (Fig. 1) (20. 36. 39). 

Using the integrated approach offered by comprehensive 
2-dimensional gel databases (Fig. 1). it will be possible to 
identifv phenotype-specific proteins; microsequence them 
and store the information in the database: search for homol- 
ogy with previouslv characterized proteins; clone the 
'-'DNAs. assign partial protein sequences to genes for which 
the full DXA sequence and the chromosome location are 
known, and study the regulatory properties and function of 
','roups of proteins (pathwavs. organelles, etc.) that are coor- 
dinatelv expressed in a given biological process. Comprehen- 
sive 2-dimensional gel protein databases will depict an in- 
tegrated picture of the expression levels and properties of the 
thousands of protein components of organelles, pathways, 
and cvtoskeletal systems in both physiological and. abnormal 
conditions and are expected to lead to identification of new 
regulatory networks in different cell types and organisms. In 
the future. 2-dimensional gel protein databases mav be 
linked to each other as well as to national and international 
specialized databanks on nucleic acid and protein sequences 
protein structures. NMR experimental data, complex carbo- 
hydrates, etc. 

lew 2-dimensional gel protein databases that are accessible 
m a computer form have been published in extenso: these 
correspond to the protein-gene database of Escherichia coi, 
K-12 developed by Neidhardt and colleagues (14. 23). the rai 
REF 52 database established bv Garrels and co-workers at 
Cold Spring Harbor (18. 221 and a few human databases 
aransformed amnion ceils [15. 20). normal embrvonal lung 
MRC-5 fibroblasts [17. 21). keratinocvtes 1 19] and peripheral 
blood mononuclear cells [15]) developed in .Aarhus. Given 
space limitations and to keep this review in focus, we will 
concentrate on the computerized analysis of human cellular 
--dimensional gel patterns, and in particular on the steps in- 
volved in establishing comprehensive 2-dimensional gel 
databases that can link protein and DXA information. 



MAKIXG AXD MAXAGIXG A COMPREHEX<l\ v 
2-DIMEXSIONAL GEL DATABASE OF HIM av " 
CELLULAR PROTELXS 

The rirst step in making a comprehensive 2-dimfn<ion.i: 
protein database is to prepare a svnthetic imace i riicrrai 
oi the gel image I of the gel (riuorogram. Coomassie biuo or 
ver stained geH to be used as a standard or master rclcrt-n. l 
This can be done with laser scanners, charge couple de\ m 
(CCD)- array scanners, television cameras" rotaiin'j drun 
scanners, and multiwire chambers il3i. Computerizi~d anal- 
ysis systems for spot detection, quantitation. Dariern match- 
ing, and data handling (access and retrieval of inicniation 
database making) have been described m rhe literature 
(ELSIE [43). GELL.\B [11). HER.MeS !4-^- MELANIF 
1 10]. QUEST (9), and TVCHO [8). and som.- are .naiiabU- 
commercially (PDQUEST Protein Database Inc.. Huntin- 
ton. X.Y.; KEPLER. Large Scale Biology. Roekville. .Nuf 
V isage. Biolmage Corporation. Ann Arbor. .Mich.: Gemini 
Joyce Loebl. Gateshead: Microscan 1000. Teehnoloijv 
Resources Inc., Nashville, Tenn. and .MastcrScan. BiUenca. 
Mass.). Unfortunately, most of these systems are incompati- 
ble with one another and their advantages and disadvantaeev 
have been discussed by Miller (13). 

In our work station in .Aarhus. fluorograms arc scanned 
with a .Molecular Dynamics laser scanner and the data are 
analyzed using the PDQUEST II software (Protein Data- 
bases Inc.) (12) running on a spark station computer 4100 
FC-8-P3 from SUN Microsystems. Inc. The scanner meas- 
ures intensity in the range of 0-2.0 absorbance. A typical 
scan of a 17 x 17 cm fluorogram takes about 2 min. Steps 
in image analysis include: initial smoothing, background 
substraction. final smoothing, spot detection, and fitting of 
ideal Gaussian distribution to spot centers. Spot intensity is 
calculated as the integration of a fitted Gaussian. If calibra- 
tion strips containing individual segments of a known 
amount of radioactivity are used, it is possible to merge mul- 
tiple exposures of the sample image into a single data image 
of greater dynamic range. Once the synthetic image is 
created it can be stored on disk and displayed directly on the 
monitor. Functions that can be used to edit the images in- 
clude: cancel (for example, to erase scratches that may have 
been interpreted as spots by the computer: cancel streaks or 
low dpm spots), combine (sometimes a spot mav be resolved 
into several closely packed spots), restore, uncombine and 
add spot to the gel. The process is time consuming - about 
1-1/2 day per image. Edited standard images can be matched 
to other svnthetic images. Figure 2.4 shows a portion of a 
standard svnthetic image (IFF) of a Huorogram of 
I Sjmethionine labeled cellular proteins from human .WW 
cells (master database I (20). Images can be displayed cither 
in black and white (resembling the original Huorograms) or 
in color (Other images in Fig. 2). depending on the^necd. As 
shown in Fig. 2B, each polypeptide is assigned a number by 
the computer, which facilitates the entry and retrieval of 
qualitative and quantitative information for anv given spot 
in the gel (20). The standard image can be matched auto- 
matically bv the computer to other standard or reference gels 
(Fig. 2C. matching of .AMA cellular proteins [left] to .MRC-") 
proteins (rieht)) provided a few landmark spfxs are given 
manually as reference (indicated with a - in Fig. 2C) to in- 
itiate the process. 



vAbbreviaiions: CCD. charge rouplt: device: PC.N.A. proliferai- 
ing cell nuclear antigen; HPLC. hich perlormanee liquid chromaiog- 
raph\: 




Minv labeled pro.eins from normal human MRcTfiLbfas TK f ' "■^'"""""^P'^ °' - '-"'-n of ;.n !EF liunru.ran, oi ( 33S)mc->h,o- 

/-> Th. funcnon peruse annotation for ^spo^a owsTh operf ^^^^^^^^ 'f" ""'^^ ,.v,olv„c p.n.hwav. 

(7. Relat.ve abundance of cv.oskele.al and cvt^r^l ,a,X,cd o ot "ns ""'"T '"f"^— " - -'-^bi.- lor ,-, eiven pr-cn. 

i^->- Polvpep,.des ,ha. con,a,n .nforma.on ^^^l^^^tlZ::";::^::::::- ^^^^^^^ ^,^0-3 Hbr-b- 
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The automatic matching process that has been described 
in detail by Garreis et al. (12) takes about 5 min. Matched 
proteins are indicated with tlie same letters in both gels (Fig. 
2C). The usefulness of this function is emphasized by the fact 
that data accumulated on common household- proteins can 
be easily transferred to any other human cellular cell type 
whose 2-dimensional gel cellular protein pattern is matched 



to our standard AMA 2-dimensional gel protein image. Al- 
ternatively, if the standard gel is pan of a matchset (set of 
gels in a given experiment) it can be used as a linker gel to 
compare, for example, the quantitative values of a given pro- 
tein throughout the experiment (see Fig. 2D; levels of some 
proteins in normal and SV40 transformed human MRC-5 
fibroblasts) or with other standard images in different sets of 



tip DPOTC>\ OJT 



-PNC 



cross-matched expcrimenis (18, 22). 

Once a standard map of a given protein sampJe is made 
one can enter qualitative annotations to make a reference 
database Our master 2-dimensionaJ gel database of trans- 
lormed human amnion cell (AMA) proteins (20) lists 3430 
polypeptides of which 2592 correspond to cellular compo- 
nents, having pi's ranging from 4 to 13 and molecular 

t'em^nrh'T^K^ ' ^"'^ "^^^ abundant pro- 

teins in the database correspond to total actm (3.87% of total 
protein, about 90 million molecules per cell) while the 
esser abundant of the recorded polypeptides are present m 
the vicmuy of 5000 molecules per cell. Some aJlnotat on 
categories we are usmg to establish the master AMA data- 
base include: 7) protein identification (comigration with 
purified proteins, 2-dimensional immunoblottine micZt 
quencmg) P) amounts (total amounts and leVels of svnthe 
sis), 3) subcellular localization (nuclear, cvtoskeletal mem- 
brane, menibranc receptors, specific organelles, etc.)- 4) 
antibodies; 5) posttranslational modifications (phosphorvla- 
tion glycosylat^on. methylation etc.); 6) microsequencmg; 7) 
cell cycle specificity (specific variations in levels of svnthesis 
and amount); 8) regulatory behavior (effect of hormones 
growth factors heat shock, etc.) 9) rate of svnthesis m nor- 
mal and transformed cells (proliferation sensitive proteins 
eel cycle specific proteins, oncogenes, components of the 
pathway (or pathways) that control cell proliferation)- JO) 

fZlTn^^nT'' p'rotems o^kiown^ 

function). J J) sets of proteins that are coordinatelv regulated 

e lis e7c r /P^ n^.°i'' «P-ssion in SrKus 

cells etc.); 12) cDNAs (cloned cDNAs); ij) proteins that are 

Zn '"f\rZ '""^^ (^y^-n^atic comparison of protein 
d.vi7"t f P'-.°'-i"^ fron> heaJthf and diseased Z 

cDNAs A nl"r""\"?-' «P'°"^"°" °f transfected 
CDNAs, 75) pa^ways (metabolic, others); 75) gene localization 

in nl^ of microinjected antSLdy 

on patterns of protem synthesis; and JS) secreted protems 

cate"J;'"'"°l '""'f ^ annotat on 

^ u ""^-^ '"""'"^'^ ^^'''"g ^he computer to 

d splay the information on the color screen. For example 

l^AA^ ^ synthetic image of a NEPHGE gel (master 

AMA database) displaying the information contafned^nde 
the entr>- glycolytic pathway Alternatively, One can use the 
function peruse annotations for spot to directlv ask the ^nn^ 
puter to list all the entries available for a panL ar J otcm" 

n ietal human tissues) ,t is possible to take a quick look at 
the information m that panicular entry (Fig 27^ 

A major obstacle encountered in building comprehensive 
numb^^'r^ '^^'^^^^^ .den^ifving'the Ta ; 

SatTbasesl20"2rri "^'""'"^ ^" °- 

. rn^l ■ ' V' proteins are identified by one or 

a combination of the following procedures: 7) comigration 

u nV7""r"""K^i-^-'''^ immunoblotting. 
using specific antibodies, and 3) microsequencmg of 

f^oTn^''^ ^o'?"' ^^"^ ""'"^'^ P^tcins recovered 

from dried 2-dimens.onal gels (see next section) Protem 

s indfv?d'n """^ °f-icrosequencingmay bi dl^cult 
d f^.rl^ P'"°'''" of families with shon peptid^ 

differences may escape detection. In the gene-protem data- 

dTJL "'-.''m ' -ajor'2-di:Sen;ional7el 
a w?der r r ' P^""'' P^°"'"^ ^« being identified by 

pro ems 3.? '"-'"^^ -o-^is-tion with purified 

proteins, genetic criterion (deletion, insertion, frameshift 
non ense missense, regulatory), plasmid-bearing s a n^ 
and m vitro synthesis of protein; selective labeling (methyla 
t.on phosphorylation); peptide map similarity; and pTvIio- 
logical criterion and selective derivatization 



by 2^im?L:r^t;i:^;^L^[:;:^;--:--- 

= se^S?S^r-^t~ 

available all the mfonnat.on we mav have accumulated on tha: 
partjcu ar protein. For example. Table 1 hsts entries avaUa 

V. VAC-o. endonexin II. renoconin. chromobindm-5' an- 
ITate ^^P^°b.ndin. 

As mentioned previously one distinct advantage of 
2-dimensional gel electrophoresis is the possibilitv of sfudv- 

^re sid clo i"'!?" "^^'■^"P^ of proteins that are e.x- 

pressed coordinately during a given biological process 

lack of published data on global cellular protein patterns We 

edsThat " "'"'k,'"^. difficulties in obtaining sets of 
gels that are suitable for computer analvsis (streakine 

nons (laborious editing time, need of calibration strips to 
merge .mages, limited dynamic range, etc.) in the computer 

advanced quantitative studies published so far using com- 

workertrfs-'oo r"'' ^^^'^ 

workers (18, 22). In panicular. these investigators have estab- 

ished a quantitative rat protein database (18. 22) designed 

t.m:fa'tifnT:nd°""°V^P^°'''"^"°"- ^^^'^ mhi^it^A.^and 
stimulation) and transformation m well-defined groups of 

ce« lines obtained by transformation of rat REF52 cellslith 
Thei .r "^""'T'' ^"'^ '"^""^ "-■^0"'^ virus, 

r^tressed h"- ^'"""^ of proteins induced or 

rr?n!fn^ '"^ ^'""''''^ '° confluence as well as groups of 

tiaj tashion to transformation by DNA and RNA viruses A 
most interesting feature of this quantitative database is the 
lar'.vn'^ °^ coregulated proteins that show simi- 

a lorT n ''f ^y^'-^-fulated DNA repli- 
(pKc/in ""'^'"^ 

car^ed"o„r^'" •'^^'^•''^"^ quantitations have been 

To VDeondL h 'he radioactivny contained in the 

pol> peptides by direct counting of the gel pieces in a scintil- 

hr^an ''^ '° P^°'^'- be out 

oarable fJ^T^""' "PT^" ^'"'^ ^ P"'"'^ com- 
parable to tha, required for editing a synthetic image. 

^^thou, "'^!?" '^^^^ """'^^ °^^P°" - difficult 

without the assistance of a master reference image and a 

numbenng svstem tha, can be used to identify the spots. Us- 
ng ,his approach, we have recorded quantitative changes in 
he relative abundance of 592 ['^SJmethionine-labeled'pro 
eins synthesized by quiescent, proliferating, and SV40 
transformed human embryonic lung MRC-5 fibroblasts (21) 
borne data concerning cvtoskeletal and cv,oskeletal-related 
tho ."nf r P;««=nted in Fig. 2G. Our studies as well as 
those of Carrels and co-workers (18, 22) may in the long run 
help define patterns of gene expression that are characteristic 
of the transformed state. 
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datIEasr.°"'^^Kr'°"''- ^""^ 2-dimensional gel 

databases available in computer form tha, have been pub- 
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TABLE 1. Somr fnrncs for kpocornn V ,r. the humcr. A.\tA 2 -dimensional gfl prolan iaiabast 



Knirii:! for iipoconin \' iIEF SSP S2lD' 



iniorinaiion entered 



1 Protein name 

Percentage of total proton 

) Apparent molecular weight imri 

4 Isoelectric point (pi) 

r Method (or methods i of identification 

6 Orcdit to invcsticators that aided in 
identification 

7. Antibodv against protein 

H. Comi^ration with human proteins 

9 Cellular localization 

HI Calcium/phosphoiipid-ciepcndent 
membrane proteins 

I I . Function 



Lipocortin W renocortin. chromobindm-5 . endonexin I. anticoat:uiant protei- 
PAP-I. \AC-Q. 35-)-calcimedin. IBC. calphobindin I. anchonn CM. annex::- \ 
O.nOCr labout 2.800.000 moiecules per cell) 

33.3 kDa 
4.76 

Microsequcncing 2-dimensionaJ immunoblottmg Comieraiion 

G. Bauw. J. \-andekerckhove. and colleaeues. Riiksuniversiteii G^ n:: K !\ i,i:ukn 
BIOGEX. Cambridge; .N.G. .^hn. Universit\ of Washincton 

Polvclonal (rabbit, antibodv no. 201. B. Pepinskv . BIOGEN. Cimbruice 

Lipocortin \-.N.G. Ahn. Howard Hughes Medical Institute. \Va.<hini;toii l nn, :MiN 

Subcortical membrane 

Lipocortin \' 

Rcgulauon of various aspects of inflammation, immune respoiiM-. blnoj u..iLMii.ii i, 
and difi"ercntiation 



12. P. irii.il amino acid sequence 

1 .) cD.N.A sequence 

14. Le\ els in letal human tissues 



1.) Li'vels in cjuiescent. proIifer;itin<;. and 
trnnslormed .\IRC-3 fibroblasts 

Ml IDistribution in Triton supernatant and 
< vioskeletons 



GTXTDFPGFDER (7-I8V VLTEII.ASR ,10P-1]7V QX VEEEVGSSLEDinAC; 
(127-143). i-GTOEEKFITIFCTCR) (187-201) 

Known. R, Blake et al../ Bwi Chan 263. 10799-IOSll. 19S8 
( pi = 4./ 6 trom translated sequence! 

.Adrenal glands = - - . ; brain = - - _ ; 

cerebellum = - - - ; ear = - - - ; eve = - - , 

heart = - • _ ; hvpophvsis = * * - ; liver = » * - ; 

lung = - - - ; meninges = ~ ~ - : 

mesonephric tissue = * - ; 

striated muscle = ~ ^ . ■ pancreas = - - - , 

skin ■= T - + ; spleen = - - - ; stomach = - _ * ; 

submandibular gland = - - - ; 

small intestine = - - - ; thvmus = - - - . 

thvroid gland = - - - ; tongue = - - - ; 

ureter = - - - 



Qiquiescent) = I.l: P (proliferating) 
T (S\'40 transformed) = 0.3 

-Mainlv supernatant 



1 .0; 



iishfd in extcnso: these correspond to the E. coli K-12 
proiein-ijene database (14. 23) and to the rat REF52 data- 
Ijase (18. 22). 

Tin- £. coll K-12 cellular protein-gene database is perhaps 
rla- most complete of all databases reported so far and even- 
iiialK It should trace each protein back to its structural gene. 
Inlormation contained in this database includes: gene/pro- 
t(.in name (protein name. EC number, gene name); 
2-diincnsional gel spot designations (.\-y coordinates from 
letcrcnce gels, alphanumeric designation); genetic informa- 
non (linkage map location, physical map location. Genebank 
I ode. sequence reference, location on Kohara clones); bi- 
'uhemical information (molecular weight, pi. number of 
residues of each amino acid. nr»plc percent of each amino 
acid, total number of amino acids in a polvpepiide), and 
regulatory intbrmaiion (cellular level of protein in different 
media and different temperature, member of reguten. mem- 
ber of stimulon). Major advances of this database are en- 
visaged in the future in view of the eminent sequencing of 



the whole £. colt genome as well as the development of im- 
proved methods to express cloned genes. 

The rat REF52 2-dimensional gel protein database lists 
about 1600 proteins that have been recorded using the 
analysis system (18, 22). Included in this quantita- 
tive database are 1) protein names (cytoskeletal and heat 
shock proteins as well as various nuclear, mitochondrial, and 
cN'toplasmic proteins). 2) annotations i.subctllulur localiza- 
tion, modification, recognition b\ specific antibodies, 
coprecipitation, NH;-terminal sequence, cross- relerente to 
protein sequence information and rcierenccs to the litera- 
ture). J) protein sets (cytoskeletal proteins. phos[)hoproieins. 
sets of proteins with PCN.A/cyclin-likc properties, etc.) and 
4) general quantitative data (protein synthesis ciurmg growth 
of normal REF52 cells to confluence and quiescence, and af- 
ter restimulation of growth-inhibited cell.s). 

In addition to the 2-dimensional gel databases mentioned 
so far there arc several smaller cellular databa.ses being es- 
tablished in human (normal human dipluid librobla.sts, lym- 



phocytes. leukocytes, leukemic cells) mouse (N'IH/3T3 cells, 
T lymphocytes). Aphsia. yeast {Saccharomyces cernisae). plants 
(wheat, barley sorghum), and Euglena. Databases of tissue 
protein, fbrain. whole mouse, liver) and bodv fluid proteins 
(plasma protems. cerebrospinal fluid, urine, and milk) are 
bemg established in several laboratories. The reader is 
directed to the review by Cclis et al. (4) for details and refer- 
ences concerning these databases. 



MICROSEQUENCING H.\S .\DDED .a. XEW 
DIMENSION TO COMPREHENSIVE 
2-DIMENSION.\L GEL D.ATAB.\SES: DIRECT 
LINK BETWEEN PROTEINS AND GENES 

The development of highly sensitive amino acid gas-phase or 
liquid-phase sequenators (24). together with the establish- 
ment of efficient protein and peptide sample preparation 
methods, has opened the possibility to perform a svstematic 
sequence analysis of proteins resolved bv 2-dimensional gel 
electrophoresis. Indeed, generated pieces of protein se- 
quences can be used to search for protein identity (compari- 
son with available sequences stored in databanks) as well as 
for preparing specific D.N.A probes for cloning of as yet un- 
characterized proteins (Fig. 1). In addition, partial protein 
sequences can be stored in 2-dimensional gel databases (for 
example, see Fig. 2H) and offer a unique link between pro- 
teins and genes (Fig. 1). 

In the early 1970s gel electrophoresis was used to purifv 
proteins for sequencing purposes (reviewed bv Weber and 
Osborn in ref25). Proteins were recovered by diffusion and 
sequenced by the manual dansyl-Edman degradation at the 
nanomole level. This technique was further refined bv using 
eleciro-elution to recover proteins and by miniaturizing the 
system (25). This method has been used extensively but 
showed increasing drawbacks (low yields, protein samples 
contaminated by free amino acids, and NH;-terminal block- 
mg) as the amounts of handled protein graduallv became 
smaller (e.g., at the 10 picomol level). 

Most of the problems referred to above have been 
minimized with the introduction of protein-electroblotiine 
procedures (27-32). When proteins are blotted on chemi- 
cally inert membranes, it is possible to sequence the immobi- 
lized proteins direcilv without additional manipulations. 
Thus, depending on the amount of bound protein and its na- 
ture, this direct sequencing procedure generallv vields NH.- 
lerminal sequences containing 10-40 residues. As such this 
technique was used to identify bv their NH,-terminal se- 
quences, differentially expressed major proteins from total 
cellular extracts separated on 2-dimensional gels. A major 
difficulty encountered in this procedure is the occurrence of 
Irequent artefactual blockage of the proteins. Several studies 
suggest that this phenomenon is mainlv due to reaction with 
contaminants (particularly unpolvmerized acrvlamide 
present in the gel) and to a high dilution of the protein (low 
concentration of the protein per unit membrane surface). In 
addition to this primarily technical problem, manv proteins 
are blocked in vivo by acylation or bv a pvrrolidon carboxvlic 
acid cap. 

The problem of partial or complete NH;-ierminal block- 
age can be circumvenied by generating internal amino acid 
sequences. This is achieved by fragmenting the protein 
present m the gel (gel in situ cleavage) or bv cleaving it while 
bound to the membrane (membrane in situ cleavage) 
(3j-3o). In both cases, proteins are either cleaved in a res- 
tricted way (e.g., by limited enzymatic digestion or bv using 
restriction chemical cleavage conditions) or fragmented into 
smaller peptides. 
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Of -the different combinations e.xamined we ha- 
results bv using exhaustive proteolvtic dicesMo--'^"'" ' 
membrane-immobilized proteins. This method ha/'br--'' 
descnbed for Ponceau red-stained proteins on nnroceiiui.V. 
blots (j4). for Amido-black^tained Immobilon-bound 
teins. and for fluorescamine"^deiected proteins on ebs< htv 
membranes (35). The proteases used (trvpsin. chv^oir\ d<,, 
or pepsin) cleave at multiple sites, ceneratine small pepudv- 
that elute Irom the blot into the dieestion bJffer from which 
they are purified bv reversed-phase hieh performance liqu.d 
chromatography (HPLC) before beinc sequenced indiN-idu- 
ally .Although each of these manipulations could be c\pecied 
to result m a reduced vield of final sequence information wr 
were surprised that the peptides could be sequenced with 
high efficiency In our hands, this approach could be rou- 
tinely applied to gel-purified proteins available in amount- 
ranging from o to 10 Mg. and often v.elded sequence informa- 
tion covering more than SOfc of the total protein 
membrane-immobilized proteins are not homogeneouslv 
digested, but rather show protease sensitivitv ne.xt to resis- 
tant regions, the number of peptides generated is much lower 
than expected from the number of potential cleavage sites 
Consequently HPLC peptide chromatosrrams arc less com- 
plex and most peptides can be recovered in pure form. 

.As only limited amounts of a protein mixture can bt- 
loaded on a 2-dimensional gel. proteins of interest are often 
obtained in yields insufficient for the currentlv available se- 
quencing technology-. More material can be obtained bv en- 
riching for a certain subcellular fraction (purified cell or- 
ganelles) or bv exploiting affiniiv (dves. metals, drugs etc) or 
hvdrophobic properties of proteins before gel analvsis All of 
the sequencing results accumulated so far in the human pro- 
tein database (20) (a few are shown in Fig. 2H) have been 
obtained from analvsis of protein spots collected from 
^-dimensional gels that had been stained with Coomassie 
blue according to standard procedures and dried for storage 
Proteins are recovered from the collected gel pieces bv a 
protein-elution-concentration device, combined with gel 
electrophoresis and electroblottmg. Details of this technique 
have been reported in a previous communication (42) and a 
brief outline is given below. 

Combined gel pieces are allowed to swell in gel sample 
buffer (a total volume of 1.5 ml). The eel pieces combined 
with the supernatant are then collected into a large slot made 
in a new gel. The slot is further filled with Sephadex G-IO 
equilibrated in gel sample buffer During consecutive gel 
electrophoresis, most of the electrical current passes on the 
side of the slot instead of passing through the slot. This 
results in both a vertical stacking and horizontal contraction 
o the protein band. With this device the protein is efficiently 
eluted from the gel pieces and concentrated from a large 
volume into a narrow spot. The highlv concentrated (about 
n mm^) protein spot is then electroblottcd on PVDF- 
mcmbranes. stained with Amido black, and in situ digested 
with trypsin. The peptides generated during digestion clute 
from the membrane into the supernatant, and can be sepa- 
rated bv narrow bore reversed-phase HPLC and collected in- 
dividually for sequence analvsis. 

Lsing this and previous procedures (37, 39, 42), we have 
so far analvzed 70 protein spots collected from 
2-dimensional gels (20, and unpublished observations) (see 
o'^nn^T'"'^ sequence information amounts to 

-100 allocated residues corresponding to an average of 30 
residues per protein spot. So far we have made cDNAs of 
manv of the unknown proteins thai have been microsc- 
quenced. and a substantial number has been cloned and se- 
quenced. All available information indicates that it may be 
possible to obtain partial sequence information from most of 
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the proteins that can be visualized by Coomassie Briitant 
Blue staining. 

Partial protein sequences are stored in the database as dis- 
played in Fig. 2H. and it should be possible in the near fu- 
ture to interface this information with forthcoming D.N'A se- 
quence data from the human genome project. In the long 
run. as the human genome sequences become a\ailable it 
will be possible to assign partial protein sequences to genes 
:or which the full DN.-\ sequence and chromosomal location 
;ire known (Fig. 1). 

SUMMARY 

The studies presented in this brief review are intended to 
demonstrate the usefulness of computer-aided 2-dimensional 
gel electrophoresis and microsequencing to analvze cellular 
protein patterns, and to link protein and DNA information. 
As more information is gathered worldwide, comprehensive 
iatabases will depict an integrated picture of the e.vpression 
lev els and properties of the thousands of proteins that orches- 
trate most cellular functions. 

ClearK . databases allow easy access to a large body of data 
and provide an efficient medium to communicate stan- 
dardized protein information. In the future, databases will 
foster a wide variety of biological information that can be 
used to support collaborati\e research projects in basic and 
applied biology as well as in clinical research (2. 5. 46). Once 
a protein is identified in a particular database all the infor- 
iiation gathered on it can be made available to the scientist. 
!-lowever. many problems must be soKed before protein 
databases become of general use to the scientific communit\. 
.A most urgent one is to promote standardization of the gel 
running conditions so ihat data produced in a given labora- 
tory may be used worldwide. Surprisingh'. the gel running 
technology' as it stands today is still a craftmanship art. 

Finally, comprehensive, computerized databases of pro- 
teins, together with recently developed techniques to 
microsequence proteins, offer a new dimension to the studv 
of genome organization and function (Fig. 1). In particular, 
human protein databases may become increasinglv impor- 
tant in view of the concerted effort to map and sequence the 
entire human genome. This formidable task is expected to 
dominate biological research in the ne.xt decades. 

VN'e would like to thank S Himmelstrup Jorgenstn ior tvpini; the 
manuscript and O. Sondcrskov for photograph\ Work in ihc 
juthor.';' laboratories was supported bv grants from the Danish Bi- 
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Reference points for coriparnoni of ;-D fe! rajrs ?">c 

Reference points for comparisons of tHo-dimensionai 
maps of proteins from different human cell types 
defined in a pH scale wliere isoelectric points correlaie 
with polypeptide compositions 

A highly reproducible, commercial and nonlinear, u ide-range immobilized pH 
gradient (IPG) was used to generate two-dimensional {2-D\ eel maps of 
rS]methionme-labeled proteins from noncultured. unfracrionaied normal 
human epidermal keratinocytes. Forty one proteins, common to most human 
cell types and recorded m the human keratinocvte 2-D eel protein database 
were identified in the 2-D gel maps and their isoelectric points ^p/) were deter- 
mined using narrow-range IPGs. The latter established a pH scale that 
allowed comparisons between 2-D gel maps generated either with other IPGs 
in the rirst dimension or with difTerent human protein samples. Of the 41 pro- 
teins Identified, a subset of 18 was defined as suitable to evaluate the correla- 
tion between calculated and e.xperimental p/ values for poivpeptides with 
known composition. The variance calculated for the discrepancies between cal- 
culated and e.xpenmental p/ values for these proteins was 0.001 pH units 
Comparison of the values by the /-test for dependent samples (paired test) 
gave a /j-level of 0.49. indicating that there is no sisnificant difference between 
the calculated and experimental p/ values. The precision of the calculated 
values depended on the buffer capacity of the proteins, and on average it 
improved with increased buffer capacity. As shown here, the widelv avai'lablc 
information on protein sequences cannot, a priori, be assumed to be sutTicieni 
tor calculating p/ values because posi-translational modifications, in particular 
/\-ierminat blockage, pose a major problem. Of the 36 proteins analvzed m 
this study. 18-20 were found to be .V-terminallv blocked and of these onlv 6 
were indicated as such in databases. The probabilitv of A-ierminal blockage 
depended on the nature of the .V-terminal group. Tweniv six of the proteins 
had either M. S or A as .V-terminal amino acids and of these 17-19 were 
blocked. Only 1 in 10 proteins containing other \-ierminal croups were 
blocked. 



1 Introduction 

As compared with carrier ampholyte isoelectric focusinc 
'C.A-IEF). the application of immobilized pH gradients 
(IPGs) in the firsi dimension in 2-D gel electrophoresis 
offers improved reproducibility [1] because the nature of 
the pH gradient makes the resulimg focusinc positions 
insensitive lo the focusini: time [2] and to the type of 
sample applied [3]. The recently introduced ready-made 
IPG strips [4] seem to be an ideal substitute for the car- 
rier ampholyte gradients, which until now have been the 
most commonly used first dimensions in 2-D gel electro- 
phoresis. The availability of standardized first dimen- 
sions opens the possibility of comparing 2-D gel maps of 
various cell types generated in different laboratories, pro- 
vided that the focusing positions of a number of easily 
recognizable polypeptide spots common to the cell types 
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in question are known. Even though this approach is 
limited to experiments performed \Mih the same standar- 
dized IPG. the fle,\ibilit\ provided by IPGs allow.s the 
pH gradient to be adjusted to the requirements of a par- 
ticular e.xperiment. 

Exchange and communicaiion of 2-D gel protein data re- 
quires a pH scale that is independent of the particular 
IPG used and by which the results can be described. Tlie 
introduction of carbamylaiion trains and the relation of 
focusing positions to the spots in these trains repre- 
sented a step forward towards solving the reproducibility 
problem experienced with carrier ampholyte focusing [5]. 
Problems associated with the use of carbamylaiion trains 
were mainly due to lack of temperature control and to 
the use of nonequilibrium focusing conditions. Accord- 
ingly, the pattern variation involved not only the re- 
sulting pH gradients, but also the relative spot positions 
as related to each other and to spots in the carbamyla- 
tion trains. Even though the question of reproducibility 
has. to a large extent, been solved, the carbamylation 
trains are still not ideal as markers because the spots in 
the trains do not represent defined entities but rather a 
large number of differently carbamylated peptides 
having close p/ values. As a result, the spots are large 
and poorly defined as compared to the ordinary polypep- 
tide spots in 2-D gel maps. 
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Neidhardt eial [6] defined the pH gradient in 2-D gel 
experiments by p/ markers whose p/ values were calcu- 
lated from the amino acid composition. Focusing posi- 
tions of other polypeptides could be predicted from their 
composition but the pA' values needed for the p/ calcula- 
tions were unknown. Various groups employing this 
approach do not use the same pK values [6. 7] and there- 
fore, the p/ values derived in this way cannot be 
expected to describe the variation of the hydrogen ion 
activity. In spite of this fact, it is still possible to make 
approximate predictions of focusing positions because 
the \>K values used to defme the pH gradient are also 
used to calculate p/ values and to predict the focusing 
positions. Errors in pA" assignments are therefore com- 
pensated. A pH scale which corretly reflects the variation 
in hydrogen ion activity during focusing should improve 
the precision of the predictions, but this has never been 
implemented with CA-IEF focusing as a first dimension 
in 2-D gel electrophoresis. The main reason for this are 
the problems associated with pH measurements in 
focused gels containing high concentrations of urea. 

IPGs can be described from the concentration variation 
of the immobilized groups, provided that the pA' values 
of these groups are known for the conditions prevailing 
during focusing. To avoid measurements on gels. Gia- 
nazza eial [8] suggested the use of pA* values derived by 
addition of determined pA' shifts. Recently, direct deter- 
minations of pA* differences between immobilized 
groups in IPGs were made by determining p/-pA' values 
in overlapping narrow-range IPGs [9, 10] and the results 
verified the applicability of the Gianazza approach. A 
description of the focusing results in a pH scale, which 
correctly describes the variation of the hydrogen ion 
activity for the focusing conditions used, not only allows 
the comparison of 2-D gel maps generated with different 
IPGs, but also opens the possibility for correlating the 
focusing position of a polypeptide with its composition 
[9). Experiments by Bjellqvist etal. [9. 10] have implied 
that pH scales showing good correlation between calcu- 
lated and experimental p/ values can be derived for any 
of the conditions commonly used for focusing in connec- 
tion with 2-D gel electrophoresis. These pH scales are 
then defined through the pA^ values of the immobilized 
groups in the IPG containing gel. To be useful for inier- 
laboraiory comparisons, however, the pH scale has to be 
defined through p/ values of easily recognizable spots 
present in the 2-D gel map. So far. p/ determinations in 
a useful pH scale, combined with determinations of pK 
values needed for p/ calculations, have only been made 
for the pH range 4.5-6.5 at lO^C |9). CA-IEF focusing as 
described by O'Farrell [11] does not control the tempera- 
ture of the first dimension, which can be expected to be 
slightly above room temperature. With IPGs, the temper- 
ature commonly used is about 20°C 14, 12] or 25 °C [13] 
and this is a critical parameter that needs to be con- 
trolled [14]. 

The present work was designed to compare 2-D gel maps 
of different cell types in a laboratorv' applying both 
CA-IEF and IPG focusing at a common temperature. To 
this end we have generated 2-D gel maps of proteins 
from noncultured, unfraciionated normal human epi- 
dermal keratinocytes with IPG in the first dimension 



and a focusing temperature or:5 C. We have used cor^- 
mcrcial nonlinear, wide-range IPG strips which gi\e l-D 
gel maps thai are closely similar to the ones resul-.mi: 
with the C.A-IEF technique used to establish the human 
keratinocyte database [15]. .^s an miiial step towards 
inieriaborator>- comparisons of results obtained with the 
nonlinear gradient as a first dimension we report here 
on the focusing positions of 4! known proteins that are 
common to most human cell types. The pH range 
covered corresponds to the range in classical C.A-IEF 
2-D gel electrophoresis and in order to use these pro- 
teins as internal standards for comparing 2-D gel maps 
generated with other IPGs we determined their p/ values 
with narrow-range IPGs in the first dimension. We have 
compared the calculated \crsus experimental p/ values 
and show that it is necessan. to have further information 
(absence or presence and nature of postiranslational 
modifications), in addition to amino acid composition to 
be able to calculate p/ values that correspond to the 
actual experimental values. The pA values used for the 
calculations are provided and the usefulness of pV predic- 
tion in relation to database information is discussed. 
Furthermore, we comment on the possibility of usmg 
experimentally determined p/ values to verifv the avail- 
able database information on polypeptide composiiion. 

2 Materials and methods 

2.1 Apparatus and chemicals 

Equipment for isoelectric focusini: and horizontal SDS 
electrophoresis fMuliiphor' 11 electrophoresis chamber, 
Immobiline' strip tray. Muitidrive .XL programmable 
power supply. .Macrodrive power supplx and Muliitemp'' 
II) was from Pharmacia LK.B Biotechnology .AB 
(Uppsala, Sweden!. Vertical second-dimcnsional gels 
were run in the home-made equipment described in [15]. 
TTie IPG strips with the wide-range nonlinear pH gra- 
dient were either Immobiline DryStrip' pH 3—10 NL, 
180 mm or aliernatively 160 mm lent: IPG strips with a 
corresponding pH gradient. In both cases the IPG strips 
were delivered by Pharmacia LKB Immobiline, Pharma- 
lyte. .A.mpholine. GelBond as well as PAG film and the 
ready-made horizontal SDS eels (ExcelGeP XL SDS 
12—141 were also from Pharmacia LKB. Purified proteins 
and peptides were from Sigma (St. Louis. MO). 

2.2 Sample preparation 

Preparation and labeling of unfractionated keratinocytes 
as well as fibroblasts have been described in [16). Cells 
were iysed in a solution containing 9.8 m urea, 2% w/v 
NP-40, 100 mM DTT and v/v Ampholine pH 7-9. 

2.3 2-D gel electrophoresis 

First-dimensional focusing was performed according to 
Gbrg eiai. [2] with some minor modifications, as de- 
scribed in [9]. Rehydration of the IPG strips was made 
in a solution containing 9.8 m urea. 2% w/v CHAPS. 10 
mM DTT and 2% v/v carrier ampholyte mixture. The car- 
rier ampholyte mixture consisted of 2 parts Pharmalyie 
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ilf's ^, P!":^'"PhoI'ne pH 6-8 and 1 pan Pharmalvte 
PH 8-10.2. Lsually. cathodic sample application was 
used and the samples were diluted 2-20 times in a solu- 
tion coniainmg 9.8 v urea. 4^o w/v CHA.PS \H w/v 
DTT and 35 m.M Tris base. For acidic application, the 
rns-base was substituted with 100 m.M acetic acid The 
degree of dilution and sample volume (20-100 uL) 
depenaed on the particular sample and the IPG and 
wnether visualization of the proteins was to be done bv 
Coomassie Brilliant Blue or silver siaininE. With the 
^^«de-range non-linear IPG. 10-30 ug of total protein 
>*as loaded for silver staining and 100-200 ue for Coo- 
massie staining. Focusing was done overniehit with Vh 
producis in the range of 45-60 kVh with 160 mm long 
strips and ^0-70 kVh with ISO mm long strips. Solubili- 
zation of polypeptides and blocking of-SH groups prior 
to the second-dimensional run. as well as loa'dine on the 
second-dimensional gel was done as described in [9] 
ITie Slacking gel was omitted and 5-10 mm were left at' 
the top of the second-dimensional gel for applvinc the 
II O strip. The space was filled with electrode buffer con- 
taining 0.>Oo w/v agarose. Casting, runnina. stainine and 
auioraaiography were carried out as described in" [15]. 

2.4 Experimental determination of pi values 

The dciermmaiion of the pA' dilTerences between Immo- 
biiines pA 4.6. pA' 6.2 and pA' 7.0 necessarv for the cali- 
bration of the pH scale at 25 C m 9.8 m urea was done 
as deserved in [9] with the same narrow-ranee IPGs 
he pH scale was defined by setting the pA" "value of 
Immonihnc pA' 4.6 equal lo 4.6! [9] and ihe determined 
PA difiTcrcnccs gave the pA" values of Immobilines pA-6 ^ 
and pA .0. equal to 5.73 and 6.54. respectivelv. The pA' 
Uillcrcnccs lound are in good agreement with values de- 
rived Irom (17) and [8] by extrapolation to 9 8 m urea 
conccntraiion. .As in [9]. additional narrow-ranue recipes 
have ncen used tor determining p/ values. With narrow- 
ranLic If Gs e.xicnding lo pH values higher than the pA' 
\alue 01 Immobiline pA' 7.0. anodic sample appiicanon 
uas u..cd with acetic acid added to the sample solution 
utncrwisc. cathodic sample application was used with 
ihc same sample buffer as for wide-range IPGs 



- Protein composilions u.sed for p/ calculations 

\\ nh [he exception of vimentin. protein composilions 
h"' T Swiss-Prot database [18). For vimentin. we 
used the data from [19], where the amino acid at posi- 
non 41 ,s a D instead of a S. Information in the Swiss- 
Proi database on phosphorylation has been disregarded 
because a was known from earlier studies (J E Cells 
unpublished results) thai the spots in question' corre^ 
sponded to the unphosphorylated forms of the peptides 
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different substituents on the a-carbon were taken into 
th?rH • of p/ values were mad? . " 

the aid of the IPG-maker program [20]. 

2.7 pA' values used for p/ calculations 

For the carboxyl terminal group and internal eluiamvi 
and aspanyl residues the same pA' values were used as in 
t-OT C-termmal glutamyl and aspartvl residues sep- 
arate pA values were derived with the aid of the Taft 
equations [9. 21]. The pA' values of histidvl croups were 
calculated from the p/ values of human car'bonic anhv- 
drase I as m [91 For A-terminal glycine a pA" value of 
7o0 was used. The pA' shift caused by a substiiuent on 
the c-carbon was assumed to be identical with the pA' 
shift the substituem caused for the amino group in the 
ammo acid /..^ 2.28 pH units were subtracted fmm the 

fn -A,'*"^ ^"^i"" ^'^ids eiven 

m 122. 23]. The approximate pA' value of 9 for the cvs- 
tenyl group was taken from [24]. For tvrosvl and arcinvl 
groups we used the pA' values for the ammo acids" (22 
.3]. For lysyl groups the effect of high urea concentra- 
tion on ammo groups was taken into account and 0 > dH 
units were subtracted from the ammo acid pA" v'alue 
These last three pA' values are far from the pH range 
under stiidy and the results found would have been the 
same if lysyl and arginyl groups were assumed to be 
fully ionized while the ionization of tvrosv! groups were 
neglected. A complete list of the pA' values used is uiven 
in Table 1. 



Table I. PA- Values used for .he .on.zable groups ,n pcp.idcs 
9-8 M urea . 2S''C 

ionizjble 
group 



C-ierminal 
V-ierminal 
Alj 
Mci 
Ser 
Pro 
Thr 
\-M 
Glu 
internal 
.Asp 
Glu 
His 
Cvs 
T.vr 

. 

-Arg 

C-iermmal side chain groups 
Asp 
Glu 



' Oil 

ft 

S..^ft 

r-.s; 

" 44 
"70 

4 ri> 

4 4 .< 
98 

ij 

10 
10 
12 

■l.S.'i 
4.75 



2.6 Calculalion of p/ values 

For the p/ calculations it was assumed that the same pA' 
va ue could be used for an amino acid residue in all 
polypeptides and in all positions in the peptide except 
lor or C-terminally placed amino acids. For the pA' 
values 01 the A-terminal amino groups the effect of the 



2.8 Statistical analysis 

Statistical comparisons of the experimental and calcu- 
lated p/ values were done on an Apple Macintosh Ilsi 
using the statistical package Staiisiica/Mac. release 3 Ob 
(from StatSoft Inc.. Tulsa. Oklahoma). Calculated and 
experimental p/ values were compared bv the /-test for 
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correlated samples fpaired Mest). The normality of p/ 
differences was estimated graphically by probability 
plots. The variances of the data presented here and the 
similar data on plasma and liver proteins in [9] were 
compared by the F-tesi. 

3 Results and discussion 

3.1 Identification of polypeptides and p/ determinations 

The 2-D gel maps of [■•SJmethionine-labeled proteins 
from noncultured. unfraciionated normal human kerati- 



nocytes. focused with the nonlinear, wide-rance IPG ^ 
C.A-IEF pH gradients in the first dim^nslon.'Ir^ snJu - 
m Figs. 1 and 2. respectively. The IPG extends to hicr.-- 
pH values but otherwise the two patterns are \er. 
ilar and most of the spots in the IPG pattern >:ar. 
directly related to the corresponding spots m i.h.- 
CA-IEF gel. To obtain comparable patterns ii was impvT- 
lant to keep the focusing temperature as similar .i> 
possible. Compared to other studies [1— ». 9. 10. 12-1-;:. 
we increased the urea conceniraiion in the focusing eel 
to 9.8 v because keratins streaked badly in the focusmg 
dimension when 8 m urea was used, presumably due to 
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Figure / :-D eel proiein map of p-S)methionine-labeled proieins from noncuUured. unfraciionated normal human keraiinocvies focused with 
int nonlinear. w,de-range IPG in the firsi dimension. The position of the JI proteins analyzed m this studv 
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aggregates of acidic and basic keratins. An m.-r^ase m 

'° ^ °^ '"^^^ eliminated these 
streaks: apart from this effect, no other maior chanees n 
the focusing positions were obsened. In Fie 1 we'have 
mdicated the positions of 41 known proteins from ^ 
likeTv .nm"'""°'-'' '"^'^ ''<^ mos! 

was made because these proteins are easv to ideniifv 

.nv^. ^"'^ °^ "^^'ifin (spot ^) 

mvoiucnn (spot 4) and keratin 14 (spot 15). which are all 
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i„^,ni ' Pr°'e'ns are listed together with the.r 

D?^e n TJ''"'''"! keratinoc^•te 
protein database and pi values determined in :-D "eel 

J^menslon"''"* ^''^ narrow-ranee IPGs m the firsi 
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3.2 Comparison between the determined and calculated 
p/ values for human keraiinoc>ie proteins 

Thirty six of the 41 proteins listed in Table 2 are found 
in the Swiss-Prot database. Conirar>- to the plasma and 
■ver proteins used in [9]. the p/ calcuations on the pro- 

enec^n'rH P^^b'^f^s that 

renecied the way m which ihey were characterized The 
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proteins used by Bjellqvist et a/. [9] were eithe- v^^ 
were identified by A-termmal sequencing and. thereto" 

h ! c, f I proteins used in 

his study have all been characterized bv internal 
sequencing [7] and it is known that .V-terminal acetvla- 
tion occurs with high frequency m eukarAotes. 
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According to Brown and Robert [25]. proteins with acety- 
laied A'-ienninals correspond in weight to approximately 
800/0 of the soluble protein in ascites cells. Based on 
results from A'-terminal sequencing, at least 40% of the 
spots in the human liver protein 2-D gel map appear to 
be blocked [3). The corresponding number, derived from 
107 spots in the 2-D gel map of human T-lymphocvte 
proteins, falls between 60 and 65% (J. Strahier. persona! 
communication). Information concerning A-terminal 
blockage is not normally available, and in the Swiss-Prot 
database only 6 of the 36 keratinocyie proteins are speci- 
fied as A'-terminally blocked. We have, within the present 
material, defined 18 proteins for which the A'-ierminals 
are ver>- likely to be correctly described. Six of these pro- 
teins are listed in the Swiss-Proi database as .V-termi- 
nally blocked, four represent proteins which appear in 
the human liver 2-D gel map and have been .V-termi- 
nally sequenced as liver proteins [3] and the remaining 
eight have A'-terminal groups other than .M. S and .A., i.e. 
V-terminals for which .V-acetylation is uncommon [261. 
in Figs. 4A. B. C and D p/ values calculated from Swiss 
Prot database mformaiion are plotted agamst the experi- 



mentally determined p/ values for all the keraino -. - 
proteins listed in Table 2 and for the IS selectee 
terns, as well as for the plasma and liver proiem^^j • 
from [9J valid for ICO" 

The calculations show that without knowledge of the 
status of the .V-terminal group, precise predictions of p/ 
values for eukar>-otic proteins cannot be achieved based 
on the information available in Swiss-Proi and similar 
databases. However, for proteins where the .V-ierminal 
status is known, we find good correlation between pre- 
dicted and experimental pi values. When the variance of 
the p/ discrepancies and the variance of calculated 
charges at the experimental p/ values derived from the 
present data set are compared with the correspondmi: 



There arc lour plois. (A) ihe J^e polyperiidcs trom normjl hunun 
keratinocvies ino corrections i. iBi ihe .-o pol> peptides irom Fig. 4 \ 
where pi values have been recalculated I'or 12 polypepuUcs unh M. 
S and .A as V-terminallv assumed blocked, based on ..alcuUtcd 
charge. (C) the 18 selected polypepiiOcs »iih inloriraiion on the 
\-isrm)n3l conMcuraiion. and iDi plasma and liver proicin> 



^ocues ,^0 cir'un^^^^^^^ T r'"" ""'VP'^'"^" '^o"' norn,al huntan kera.i- 

assumL K? J PO'VPeptides from Fig. 4A (including the IS marker polypeptides, .here p/ values have been recalculaied 

m ir'n V tTrml ° W ^ P-'-" B23 ,s indicated with an arrow ,C, 18 polypeptides w.th .n.or- 

maiion on .\-ierminal coniiguraiion and (Dl plasma and iiver proteins. 
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values derived from the data on plasma and liver pro- 
teins in (9] (Table 3). the present data are found to result 
in larger variances for the values of both pi discrepancies 
and calculated charge at the experimental p/ value when 
no information on posttranslaiional modification is 
taken into consideration. Correction for possible .V-acety- 
laiion of 12 polypeptides with M. S and A as .V-termmal 
results in a smaller variance of p/ discrepancies, al- 
though not significantly different from values derived 
from [9], whereas the variance of the calculated charge at 
the experimental p/ value is significantly higher. For the 
18 selected proteins the variance for the p/ discrepancies 
IS significantly smaller than for the data in [9]; however, 
the corresponding value for calculated charge at the 
experimental p/ value does not improve to the same 
extent. This, we believe, reflects another difference 
between the two sets of proteins used for the calcula- 
tions. Based on spot distributions in 2-D gel maps, the 
set of proteins used here has a moiecular"weight distri- 
bution that is more representative of the patterns ob- 
served in mammali:in cells. In the studv bv Bjellqvisi 
eial. [9] most of the high molecular weight plasma pro- 
teins had to be excluded due to their unknown content 
of sialic acid which made the proteins analyzed in this 
study heavily biased towards low molecular weight pro- 
teins. The bulTer capacity of proteins normally increases 
with the protein's molecular weight, and the average 
bufler capacity of the presently selected proteins wjth 
assumed known /V-iermmals is 18 charge units/pH unit, 
while the corresponding value for the "proteins used in 
[9] IS only 9 charge units/pH unit. High bufler capacitv 
can be expected to improve the agreement between cal- 
culated and experimental p/ values. Inspection of the 
data presented in Table 2 for the polypeptides with 
assumed known .V-ierminals verifies the importance of 
the bulTer capacity. For 8 polypeptides having buffer 
capacities higher than 15 charge units/pH unit, the calcu- 
lations in all cases yielded p/ discrepancies with absolute 
values of less than 0.02 pH units. The largest discre- 
pancy. 0.06 pH units, was observed for annexin II and 
stathmin. proteins which have low butfer capacity: 0.9 
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and 6.6 charge umts/pH unit, respeciiveiv. TTie proba- 
biliiy thai the focusing position of a protem w,th kno« n 
composition will fall within a certain distance from the 
calculated p/ value therefore cannot be predicted bv the 
variance alone. The buffer capacity of the specific proiem 
must be taken into consideration as well. .As indicated 
by the decrease of the variance of calculated charees at 
the experimental p/ value for the selected proteins, the 
observed improvement can not solely be due to the 
higher buffer capacity of the keratinocvie proteins. The 
two studies relate to different experimental conditions. 
Good agreement between experimental and calculated 
p/ values implies that the proteins are deiblded and a 
factor that may contribute to the obser\-ed improvement 
is a more complete defolding of proteins caused bv the 
higher temperature and urea concentration used m this 
study. 

The data indicated that the precision with which p/ 
values can be predicted for polypeptides with hish buffer 
capacity is better than the precision with which experi- 
mental p/ values can be determined. If the pH is defined 
through the pA' values of the immobilized aroups m the 
IPG containing gel. the precision of the e.xpenmeniallv 
calculated data will depend on the pH difference 
between the p/ and the pA' value of the immobilized 
group with the closest pA'. For the present studv this will 
give p/ determinations with a precision varvine in the 
range of - 0.02-0.05 pH units [9). The eood ae'reement 
observed between the calculated and experimental p/ 
values IS due to the fact that errors are mainh svstem- 
atic and. as discussed in [9]. they will largelv be cancelled 
out in the calculations. \ pH scale defined ihroueh the 
presently determined p/ values will not necessarilv 
reflect the variation of the hydrogen ion aciivitv during 
the focusing step in an optimal way. but it stiil allows 
precise predictions of focusing positions for polvpeptides 
with known compositions, including information on 
posttranslational modifications. Calculated net charce at 
the experimentally found isoelectric point defined in this 
scale will serve as a tool to verify that the polvpeptide 
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composition used in the calculation is correct and com- 
plete. Exceptions to this are proteins such as involucrin 
and heat shock protein 90 that have ver>- high buffer 
capacities. Introduction of an extra charge unit into 
these proteins will only result in p/ shifts falling in the 
range of 0.01-0.02 pH units and the effect is that the 
quality of the pH definition - the precision by which pA' 
values used in the calculations are given and the preci- 
sion of experimental p/ values in these cases — will limit 
the possibilities to verify polypeptide compostion based 
on the experimental p/ value. 

Statistical comparison of experimental and calculated p/ 
values was done using the r-test for dependent samples 
and normality of the discrepancies was estimated by 
probability plots. For the 36 proteins, the ;>-level is 
0.0021. indicating that a result like this is unlikely lo 
be a chance effect and must be assumed to represent a 
real difference. After correction for the most likcl,\ 
A'-terminal configuration, the ;?-level is 0.043 and cannot 
be accepted as representing the same population since 
the p-level is less than 0.05 — the traditional ;>-limit of 
statistical significance. For the 18 proteins with a known 
or very likely A'-ierminal configuration the /-test gave a 
p-level of 0.49. which verifies that the experimental and 
calculated p/ values are not significantly different. 

Besides showing that p/ values for denatured proteins 
with known compositions can be calculated with a high 
degree of precision from average pA' values, the results 
also provide strong support for the notion that 
A'-terminal blockage heavily depends on the nature of 
the A'-ierminal groups [26]. The results seem to indicate 
that with A-terminals other than M. S and .\. on\\ a few 
proteins have blocked A'-ierminals (1 out of 10 proteins 
in the present study), while it can be inferred from the 
data presented in Table 2 that a majority of the proteins 
with M. S and A as A-ierminal are blocked, .\fter correc- 
tion for the effect of suspected A-terminal blockage 
there is only one protein (nucleolar protein B23) out of 
the 36 used in this study, which, in spite of a high buffer 
capacity, has a marked difference of 0.11 pH units 
between predicted and determined p/ values (Fig. 48 1; 
this corresponds to 3 charge units due to the high buffer 
capaciiv of this protein. This discrepancy in p/ prediction 
and calculation of net charge at the p/ is probably not 
due to deficiencies in the database inl-ormaiion but 
instead rellects a shortcoming of the model used for p/ 
calculations. Nucleolar protein 823 contains a domain 
extremely rich in aspartic and glutamic acid residues 
(Table 4). in which 26 out of 28 amino acid residues 
from position 161 to 188 are either a D or an E. .A calcu- 
lation based on the use of average pA' values unin- 
fluenced by the charged neighboring amino acid resi- 
dues cannot be expected to correctly describe the p/ 
value with almost half of the acidic groups packed 



Table -1. Amino acid .sequence of nucleolar phosphoprolein B23 



together into a highly negatively charged recion. T- v 
limitation caused by calculations based on jverV.^: -a 
values does not severely limit the usefulness o:' 
approach since a search through Swiss-Prot shou ,'; 
this type of D/E-rich motif is uncommon, and the e\i> 
tence of a highly charged region is immediately arparcr:; 
upon inspection of the amino acid sequence. 

The quality of the information available in databases 
especially concerning posttranslational modifications, is 
a major problem when the data is to be used for p/ pre- 
dictions. The /?-level of 0.043 found for all 36 proteins 
after correction for .V-acetylation. shows that this prob- 
lem is not only limited to .\-ierminal blockaiic and the 
very good agreement found ibr the eighteen puivper- 
tides, with assumingly correctly described .\-terminal 
(Fig. 4C). must be regarded as an exception from this 
point of view, A'-Terminal blockage is generally the main 
problem in relation to p/ predictions for eukaryoiic pro- 
teins. Of the 36 keratinocyte proteins analyzed. 18-20 
are suspected to be A-terminally blocked (6 proteins blo- 
cked according to Swiss-Prot. 12 proiems with M, S or .X 
as A-terminal and assumingly blocked based on the cal- 
culated charge, and two proteins, involucrin and 
nucleolar protein B23. with .M as .V-terminal for which 
the data does not allow any conclusion). This is in rea- 
sonable agreement with the conclusions based on the 
A-terminal sequencing data derived in connection with 
2-D gel electrophoresis. .V-ierminal blockage can be sus- 
pected for 17-19 of the 26 proteins with M. S or .\ as 
.V-terminal. while only 1 in 10 proteins with other 
A-terminal groups are blocked. The information that the 
frequency of .V-ierminal blockage is strongly related lo 
the nature of the .V-ierminal croup will be of some help 
in connection with p/ predictions based on database 
information. However, without information from other 
sources, an uncertainly will always remain as lo whether 
the A-terminal charge should be included in the p/ calcu- 
lation. 



•4 Concluding remarks 

The data presented here lays the foundation for com- 
paring 2-D gel protein maps of different cell types gener- 
ated with nonlinear, wide-range IPGs in the first dimen- 
sion. The locusinc positions of 41 polypeptides common 
to most human cell types have been described in a pH 
scale that allows focusing positions to be predicted with 
a high degree of accuracy, provided that the composition 
of the polypeptides are known and that information on 
posttranslational modifications are available. For poly- 
peptides with a very high buffer capacity, the limiting 
factor IS the precision wiih which experimental pH 
values can be determined rather than the precision of 
the calculations. Possible deficiencies in the pH scale 
description of the variation of the hydrogen ion activity 
has. at least at the present state, no consequences for iis 
practical use. The major limitation in connection with 
predictions of focusing positions from polypeptide com- 
positions is the quality ot existing data on protein com- 
positions, especially concerning posttranslational modifi- 
cations. .Ammo acid sequences have been reasonably 
easy to obtain, while posttranslational modifications 
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have been difiicult and work-iniensive lo deiermine. 
Recent developments in the field of mass spectrometn- 
are fast changing this situation and within the next vears 
we can expect a surge in reliable data in this area. While 
awaiting this development, verification of correctness 
and completeness of available information on polypep- 
tide composition can be provided by experimental p/ 
values in a pH scale based on the p/ values determined 
in this study. So far. our data cover the pH range below 
pH » 1.5. The basic pH range covered by NEPHGE as 
first dimension will be covered in forthcoming work. 
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^eMx«.on of hu.^ f„ 

• . u *i«.iTopnoreiu JQj 

Nonenzjinatic extraction of cells from clinical tumor 
matenal for analysis of gene expression bv t.o- 
dimensionaJ polyaciriamide gel electrophoresis 

J^Se„r„Tetec:^^^^^^^^ -'i^-n« cells for 

ussue to be superior comDare^ to ,;«L^^ found all methods using fresh 
indicate that nonenn4a^riV,H„Hc r ' """^ '""'^s 
fine needle Lp°rS™.Jl^a^^^^^^ Pr^parauon of tutnor cells, including 
using enz^m^tTc erac cells l?""^"^. have advantages over method! 

to reduce loss of " r^oLul ote.'^'Ip^^^^^^ ^^""^^ ^^P"): 

separating viable and nonviable cells bv pS '.r^^^^^ necessity of 

ihese techniques high-aualiiv 2 DF ^, gradient centrifugal ion. Using 

lung and brLi In the resuUin'. L^n''' Z"' ''"'"''^ ^^'^ °^ 
non-muscle iropomvosin. ,nJ f P°'>-P.=P"de patterns, heat shock proteins, 
dude tha noS°S™a,tT^^^^ identified. Wc con- 

•mprovcs the possSes thaTthLe tecTn "'1^^°"^ ^'""^ '""'"^ ''""^ 

nosis. ""^^ ^^"^ techniques may be useful in clinical diag- 



1 Introduction 



Tumors may develop by a number of different mechan- 
sms m any g.ven cell type, .^i the time of diagnosS 
tumors will have progressed along different pathwavs o 
various stages of malignancy. To provide a bSs f^ -ini?- 
ndual therapy ,1 is of .mponance to examine specific 
properties of the tumor cell population ,n each pa ,em 
A large number of different markers have been 
hI'^k'^h" '0 '""ease the diagnostic accuracv. It is 

i'»vel> thai a combination of serveral markers is needed 
m the future .n order to rencct different properties of 
he tumor. One important method for the resolution of a 
large number of potential markers is tuo-dimens.onal 
electrophoresis f2-DE). Extensive efTons are bemg Tadj 
m idenufymg various polypeptides separated bv ^DE 

'ides' .s'^iS'^n' T "P^"^'"" or these polvpep 
ides IS a/fected by the response to cellular transforma- 
on and various culture conditions [1.2], 1, wou^d be of 
value to transfer this mformanon to 2-DE separations of 
polypeptides from tumor tissue samples. HoweJ r. one 
prerequisite ,s thai the quality of the 2-DE gels from 
umor samples ,s comparable m qualiiv with 2 DE eeh 
from samples of cultured cells. 

fh^^^" commonly used for various bio- 

.^zed by 2-D polyacrylamide gel electrophoresis (PAGE) 

oAerum'^'n'/ '"'""^ """"^^'^ -n'arlt.nauon 
VrZT;J.rt'°"''""''' P^°'=-n^- Such nontu- 

mor cell-related variations represent serious problems in 
the interpretation and inter-paticnt comparison of 2-DE 

Correspondence: Dr Bo Fran^n. Div.s.on of Tumor Patnolo«v 

Sl"w^i:r's^j;l"- "'""-^^ 

>P JO. .Non.de. P-40: PBS. phosphaie buttered inline PC\A oroHfV 
" • ' '""--tc: we. 



fumoT'mlrii',^^ °^ ""^ P^'^P^'d f^n^ fresh 

ofTum^ r-^' ^"r enzymatic extraction 

01 tumor cells {4. 3] or after culturing tumor fraements in 
medium containing radioactive amino acids (6? ^ie e 
procedures may. however, lead to alterations in the «ne 
expression/polypepiide patterns. We are onl aware of 
one study where nonenzymatic extraction SfcelTs from 
fresh turnor tissue (prostate cancer) was used to preDare 

erraa^,'°;nd° ^^""^ "^"""^^ 
extraction and vanous nonenzymatic preparation lech- 

mques including fine needle aspiration' fo? th prepara- 
tion of cells from fresh tumor tissues. We describe 
nonenzymatic extraction procedures that are rapid lead 
to h,gh-qaality 2-DE patterns, and that allevSte tht 
necessity to purify tumor cell populations from deaS 

2 Materials and methods 

2.1 Cell cultures and samples used for spot 
identification 

A rat embryonal fibroblast cell line. WT2 (a kind aift 
from Dr. I. Carrels and Dr. S. Pattersson) was t^sed fo 
he Identification of a number of heat shock and struC 
wng ""T'l" '^'P'^'^ '""8 fibr^bli^^ . 

B l and Mrr 7""'^'"" carcinoma cells. MDa: 

231 and MCF-7 were purchased from ATCC and grown 
as recommended. Polypeptides prepared from f Tu- 
kemia type pre-B-ALL were separated by ?-DE T^e 

versit> of Michigan. Ann Arbor, USA). 



Tumor tissues samples 



In this study. 2.DE maps from seven tumors were used 
TJT^o'nT'' .llustrations: two adenocarcinoma of 
the lung (LA and LB. mucinous, both cases interme- 

onSe'IuS: r^.l''^"'"''--")- one sqamous caSra 
01 the lung (LS). one carcmoid-like breast cancer (BO 

S,? '^'?»^'>' din-ereniiated) of 

the thyroid (TA). one highly differentiated hyperneph- 

"I -:...is.i5/^j/ioin.inj5 ss on. j.i/o 



10^6 



roma. a lumor of the kidney (KHl and finallv one case 
of poorly difTereniiaied corpus carcinoma (CP). 

13 Preparation of cultured cells 

The cell monolayers were washed twice in phosphate 
buffered saline (PBS) and then scraped off in ice-cold 
PBS mduding protease inhibitors (PIH). phenylmethvl- 
sulfonyl nuoride (PMSF) mM and 0.83 mM benzami- 
dme pelleted at 660 X ^. 3 min (+4«C) and washed one 
lime before final centrifugaiion at 2700 X g. S min. The 
wet weight of the cell pellet was recorded and the cells 
were stored at -80"C until further processing. 

2.4 Preparation of tumor tissue samples 
2.4.1 General remarks 

.Macroscopically representative and non-necroiic tumor 
tissues were selected within 20 min after resection 
Parallel samples were routinely prepared for cytology 
The samples were processed as rapidlv as possible on ice 
or ai +4'C and in the presence of PIH. Cells were 
siamed with DifTQuick (Baxter) and usuallv examined at 
three different occasions during the preparation proce- 
dure: (i^ cytology sample, (ii) extracted cells and (iii) 
cells after percoll gradient centrifugaiion. 

2.4.2 Specimen acquisition 

The strategy of sample preparation is shown in Fig 1 
Tumor tissue cell samples were usuallv obtained bv fine 
needle aspiration (NA) using a 0.7 mm needle' The 
syrmge was filled with 1-2 mL of ice-cold culture med- 
lum/PlH. We found that if a tumor appeared to be ver>- 
fibrous it is diflicult to extract enough cells for 2-DE 
analysis. In these cases, two alternative techniques were 
examined, (i) The tumor was cut in the middle and the 
fresh surface scraped (SC) by a scalpel. The cell-rich 
material was then transferred to ice-cold culture 
medium fL15 with 5% fetal calf serum)/PIH. (ii) A pan 
of the tumor sample was placed in culture medium on 
ice for further processing at the laboraton- in the fol- 
lowing way: the material was cut into ver)- small frag- 
ments on a pre-cooled dissection plate and transferred 
to a small glass chamber with a 0.7 mm metal net 5 mm 
above the bottom of the chamber. Medium /PIH was 
added to cover the sample (8 mL) which was gentiv 
squeezed (SQ) towards the net in order to release and 
wash out cells. NA and SC were also compared with an 
enzymatic extraction (EE) procedure described previ- 
ously [5J: Bneny. thin slices of tissue were incubated 
with collagenase (1 mg/mL) and elastase (2 mg/mL) in 
medium for i h at 37°C. Extracted cells from ever\' 
sample were then subjected to percoll gradient centrifu- 
gaiion (Section 3.2.3). 



2.43 Separation of cells by Percoll gradient 
centrifugaiion 

The cell suspension was filtered through two nvlon mesh 
inters. (I) 250 um and (ii) 100 nm and then centrifuged 



at 660 X .1- for 5 min. The cell pellet wjs resusncnd ■ 
carefully m medium, usini: j svnnee and loaded onu"' 
iwo-stcp discontinuous Percoll/PBS eradiem \i 4 " 
(density = l.OJ g/mLi and .<4.-'''.. idensiiv = i O' Zmi- 
and ceninfueed at 1000 X for 15 mm. In this sVsi^n- 
dead cells stay on the top. vuble cells sediment to the 
interphase and en throcytes sediment to the bottom. Th; 
viability of cells in the top traction and interphase was 
checked by the trypan blue exclusion test. The inier- 
phase cell layer (> QO". vubiliiy) was collected and 
washed one time in a brec volume PBS/PIH (centri- 
fuged at 800 X V for mini. Fmallv. ihc cells were resu- 
pended in 1.4 mL PBS and pelleted ai 2700 X v for < 
mm. The wet weighi iWWi was recorded and the pellei 
was then stored at -SO C. 

2.4.4 Final preparation of cdK for 2-D PAGE analynis 

From this point, cultured cell samples were treated 
in the same way as tumor cell .>iamples; Each cell pellet 
was thawed on icc and resuspended in \.S9 uL mQ water 
per mg WW (= i.80 X WW, uL. The suspcn.sion uas 
frozen and thawed 4-> x lo break the cells (7| \ 
volume of (0.089 X WW, uL 10".. sodium dodccvl 
sulfate (SDS). includins: .v..^"„ mcrcapioethanol. was 
mixed wiih the sample and iiicubaied min on ice with 
(0.329 X WW) nL of a solution of DNa.sc I (0 144 
mg/mL 20 msi T^i^-IICI with 2 niM CACl. X ■>11.0 pH 
8.8) and RNase A (0.0718 mu/mL Tri.s) |S.91. The sanipic 
was frozen and lyophilized. Sample bullcr flO] including 
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Figure I. Experimenul fin* chari showmt mam sicps of \ht prcpam- 
iion procedures The jbbrevui.onv used for rioncnzvmalic extraction 
procedures are FZ; frozen sample preparaiion; NA. needle aspir*- 
lion; SC. scraped, and SQ. squeezed sample Exiracied cells are then 
loaded as a suspension (top volume of each lube) onto either 
1.07 g/mL Percoll (left,, or a disconi.nuous Percoll gradient from the 
nonenzymauc extraction i middle,, or from enzymatic extraction 
(nfhtl. Cellular top- and mierphasi; fractions are then used for 2-DE 
For details see Section 2. 



eiearofitatnu I99J. H. 1045- I05J 

(NP-40). and 3-p.cholamido propvO-dimethx^a^L ? 
1-propanc sulfonate rCHAPS: 2rlr,'"4-?ddcTSr° - 
fuJIy. m.xcd for 2.5 h and cenmfuged (or lS miT!; 
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2.4 J Preparation of frozen tumor tissue 

The technique has been described previously f5.I2]. 
Brien.v. the sample is moaned frozen to a fine powder, 
homogenized, lyophilized and solubilized in sample 
bufTer. 



2.4.6 Control of representatirity 

The tumors were examined routinely by experienced 
pathologists and smears or imprinis from the samples 
were also assessed for cyiomeiric DNA content by 
microspectrophoiomeiry. 

2.5 2-D PAGE 

2-D PAGE was performed as described [8.10} except for 
the following details. The glass tubes for lEF. 1.2 X 200 
mm. contained 2.0'ki Resolyie. pH 4—8 (BDH) and were 
cast to a height of 180 mm. A stock solution of acryl- 
amide (Serva) and A.A"-meihylenebisacrylamide (16.7:1 
for lEF and 37.5:1 for the second dimension) was deio- 
nized by mixing with 5% w/v Duolite MB 5313 mixed- 
resin ion exchanger (BDH) for 30 min. filtered (with a 
0.22 um nitrocellulose filter) and stored at — 70''C. 
A'.A--jMethylenebisacr>lamide. A'.A'.A'.N'-ietramethyieth- 
ylenediamine (TEMEDj and ammonium persulfaie were 
purchased from Bio-Rad. lEF tubes were prefocused at 
200 V in 60 min. To each tube a sample corresponding to 
20—40 \ig protein was applied and focused for 14.5 h at 
800 V and finally 1.0 h at 1000 V using a Protean II cell 
(Bio-Rad) and Model 1000/500 Power Supply (Bio-Rad). 
The tube gels were finally extruded into 1.25 mL equili- 
bration bufTer. coniaming 60 mM Tris. pH 6.8 (2% SDS. 
100 mM dithiothreitol and 10% glycerol), frozen on dr\' 
ice and stored at -70°C. The second dimension (1.0 X 
180 X 90 mm) of the acn,-iamide concentration was 10% 



Jn^ t« contained 3"t> mM Tns. pH 8.S. jnd v ' 
SDS. lEF gels were applied on top of the sbb ue! i-a... • 
with Q.>% agarose containins electrophoresis runnm- 
bufler (60 mM Tris-base. 0.2 .m givcine and 0.1-. SDS'~ 
and electrophoresed with lO-lfmA per gel (consum 
current) at --lO'C. Si.\ gels were run together in j Pro- 
tean II xi 2-D .Vlulii-Cell (Bio-Rad). Protems were visual- 
ized by silver staining and photocraphed with the acidic 
side to the left 113.14]. 

2.6 Identification of polypeptides 

Vimeniin and vimentin-derived polypeptides were identi- 
fied by extraction of an MD.A-23I cell Ivsaie with 0.^ m 
KCl/0.5% NP-40 [15]. Tropomvosins were e.\ciracicd 
from MDa-231 and WI38 cell lysates (16J. and cvtokera- 
tins were extracted from MD.V231 and MCF-" cell 
l.vsates (17). The patterns were compared with published 
maps (19-21). Proliferating cell nuclear antigen (PCN.A) 
was identified by immunoblotimg (PC 10 m.\B. D;iko- 
pati) using a semidr> system (Muliiphor 11 Nova Blot. 
Pharmacia-LKB Biotechnology .\B) and enhanced che- 
molummescence (ECL) detection (.^mcrsham). 

3 Results 

3.1 2-DE of samples prepared from normal and 
tumorigenic cultured cells 

The object of this study was to develop methods for pre- 
paration of 2-DE maps from human tumor tissue which 
have the same high resolution as those obtained from 
cultured cells. Shown in Fig. 2 are high resolution 2-DE 
gels prepared from cultured cells and one leukemia: 
SV40 transformed cmbr>onal rat fibroblasts WT2 (Fig. 
2a): human MDA-231 breast carcinoma cells (Fig. 2b): 
human WI38 fibroblasts (Fig. 20 and human pre B-ALL 
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cells (Fig. 2d). Polypeptides were identified ihroueh a 
laboraion exchange of cell samples/2-DE maps "and 
through 2-DE analysis of purified proteins (Table I). 

3.2 Preparation of samples from solid tumors 
3.2.1 Fresh versus frozen tissue 

An adenocarcinoma of the lung (LA) was prepared for 
2-DE by conventional methods using frozen material 
(Fig. 3a). There are several possibilities for the poor reso- 
lution using frozen tissue, including the presence of high 
molecular weight protein aggregates. Filtering extracts 
through 0.1 urn filters fDurapore. Millipore) resulted in 
a slightly improved resolution (not shown). When fresh 
tumor tissue from tumor LA was used for sample prepa- 
ration, using fine needle aspiration to collect the cells 
the resolution was considerably improved (Fig. 3b) The 
use of fresh tissue resulted in a general increase in reso- 
lution, which was most pronounced in the 50-100 kDa 
molecular mass range. A number of differences in the 
protein profiles of the gels in Figs. 3a and 3b can be ob- 
served, some of which are indicated in the fieures The 
decrease m serum albumin in Fig. 3b is likelv to result 
from loss of serum proteins occurring when cells were 
pelleted after aspiration. Other difTerences. such as the 
decreased level of transformation-sensitive tropomvosins 
(T.M1-T.\i:). may result from enrichment of tumor cells 
m the sample of Fig. 3b. Fine needle aspiration, a well- 
established technique in cytology, extracts mainlv tumor 
cells because of decreased intercellular adhesiveness of 
neoplastic cells as compared to normal tissue. Micros- 
copic examination of DifT-Quick-stained extracted cells 
from case LA revealed almost \00% tumor cells 
whereas the whole tissue extract contained approximate- 
ly 60"ii mmor cells. 
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EFl 
CT 
hsp60 
hjp73 
hspSO 
h$p90 
hsplOO 
IF* 
kS 

LamB 
Lipl 
Lip2 
Lipi 
Mill 
.Mit2 
I^iLJ 
MRP 
pcna 
PLC 
RO 
SA 
aT 
bT 
tml 
tm2 
tmJ 
tm^ 
im5 
TPI 
V 

Vidl 
Vid2 
Vid3 
Vid4 
V,n 



Acuns 

alpha-Kaxnin 
Protein B2j /N'umatnn 
Elongation factor 2 
Elongation factor I 6 
Gluuthione-S-irantpheratr ijii 
Heai shock protein 60 
Heal shock protein 73 
Heai shock protein 80. GRPTg. BIP 
Heai shock protein 90 
Heat shock protein 100. EndopUsmin 
Iniennediar>' filameni associaied 
Cviokeraun 8 
Lamm B 
Lipoconin I 
Lipocomn II 
Lipoconin V 
Miicon 1/6 - Fl .ATPase 
Mitcon 2 
Miicon 3 

Mucine Related Polypeptides 
Ploliferating cell nuclear aniigen 
Phospholipase C (1) 
RO/SS-A antigen 
Serum Albumin 
a/^Ao-Tubulin 
6*j>i<i-Tubulin 

Non-muscle tropomyosin isoform 1 
Non-muscle tropomyosin isoferm 2 
Non-muscle tropomyosin isoierm i 
Non-muscle tropomyosin isoiorm 4 
Non-muscle iropomyosin isororm i 
Triose phosphaie isomerase 
Vimeniin 

V'imeniin derived protein 
Vimentin derived protein 
Vimeniin derived protein 
Vimeniin denved proiein 
Vinculin 



a. homologous position with respect to other 

b. purified proieinis) 

c. immunobloiiing 
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a 
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b and j 
b and a 
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b and a 
b and a 
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b and a 
b and a 
b and a 
b and a 
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122 Comparison of difTerent methods for preparins 
cells from fresh tumor tissue 

Samples were prepared from breasi and lung carcinomas 
using either an enzymatic ireatmeni with "coilaeenase/ 
elastase or using nonenzymatic preparations (Fig. 4). a 
number of differences in the protein profiles were ob- 
ser^-ed m the resulting 2-DE gels, some of which are 
mdicaied in Figs. 4a and b. These differences include 
both mcreases and decreases in spot intensity. These dif- 
ferences may result from degradation of high molecular 
weight polypeptides during enzymatic treatment, in- 
creased solubilization of polypeptides, or mav have other 
causes. For many lumors. it was only possible to obtain 
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%mi\\ amounts of matsrul smce thf^ w.vv r^s-^rvcJ m- 
other examinations. In these cases, sjmples could b « nr - 

-scraping. Figure aa shows a 2-DE «l prepared iron- 
squamous lung carcinoma (LS) cells collected b^ needi-' 
aspiration and Fig. 5b shows a sel prepared from the 
same tumor by scraping. In this case, a number of difTer- 
ences were recorded between the two procedures, some 
of which are arrowed m Fig. 5. Samples obtained from 
other tumors (breast and lunci ueneralh showed fewer 
differences between these two methods of cell samplinc 
(not shown). These data sho« thai difTerent noncnzO- 
matic extraction procedures may viold difTerent polvpep- 
iide patterns. However, the number of spots uith a hrue 
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r,wrcy 2-DEjnjlvs,sof3Cjseonungcan«r(LS. Comparison of 2-DE tcl Qu .l„^ ,ndd , 
-Ai asp.ra.ed .needle usp.ranon. and (B. scraped preparanons frorn fresh "2^^^ '^^^ 
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Figut, b. 2-DE analysis of three other types of tumors. (A> hv 
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nonenrymaiic preparation technique. Arrowheads and circles indicate 



ypemephroma. (Bi an adenomj of the thyroid and (C. corpus cj 



some cyiusolic polypeptides. 



rpu'. cancer, usinj; ihe 
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difTerence in iniensiiy were lower than when a nonenzv- 
malic preparation was compared with an ennmatic ore- 
paration. 

2-DE maps of satisfacior\ quality were prepared bv a 
third procedure. Cells were released from small piecei of 
tumor by squeezing (see Section 2). Some examples of 
this are shown in Fig. 6 where 2-DE maps derived from 
a case of hypernephroma. KH (Fig. 6a). a case of thyroid 
tumor. TA (Fig. 6b) and a case of corpus cancer, CP (Fig 
6c ) can be seen. We conclude thai nonenzvm'atic tech- 
niques are useful for 2-DE analysis of a number of dif- 
ferent tumors. The quality of the resulting gets is com- 



hmrauon of .„„un ,„„on for «u.>.. s> ;.d c.c.-.,cr«o..,. tO< 

p^ble to that obuined using ciiltured ceils .comp -^ 
the gels in Fig. 2 wiih those in Fig. 4. 6 and ") \Vhr ."n' 

depend on the tumor material. For example, ven sm^u 
tumors are preferably e.xtracted by squeezing: on the 
other hand, breast cancers (which are often fibrous) 
yield saiisfactor> samples using scraping. 

3.2J Purification of cells on percoll sradients 

We considered the possible advantaee of separatine 
viable cells from dead cells. er>throcvies. and debn's 
using discontinuous Percoll eradienis. Cells collected 
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90<^ as judged by trv-pan blue exclusion test. However, it 
as found ihai the yield of viable cells decreased drama- 

r '!r""'°" immediatelv pro- 

cessed. To study the effect oflysis of cells during the pre- 
paration procedure. 2-DE maps were prepared from 
nonenzymatically extracted cells of case LB collected 
trom the top fraction (nonviable. Fig. 7a) and interphase 
fracfon (Viable. Fig. 7b). Tliese 2-DE maps wSe 
compared with corresponding fractions (nonviable Fie 
ok^lrT-/'^- °r =n2ymaiicaily extracted cellf: 
rh^f disadvantage oi the enzymatic technique was 
that uhen loss of cell viability occurred during prepara- 
tion, a dramatic loss of high molecular weight polvpeo- 
tides was observed (Fig. 7c). This was probably due ,o 
degradation of intracellular proteins. However, nonenzv- 
maiic preparations showed fewer differences between 
viable and nonviable cells: The most pronounced ah"a- 
.on w^ a decrease of a group of mucme related pro- 

nuous Percoll gradient is necessary after enzvmaiic 
extraciion of cells, but can be omitted from the nonTnzv' 
malical tumor sample preparation procedure. 

We ijsed the MD..\-231 cell line to study the elTecis of 
cell lysis and leakage of cytosolic polypeptides du .ne 
sample preparation. Remarkably, after 30 SO. 80 "nd UO 
mm of mcubat.on m PBS/PIH at 0"C. no significant 
changes were obserx-ed in the 2-DE pattern (no, shou^^ 
Although loss OI cell viability may no, result m pro.ein 
degradation when cells arc incubated in the presence of 
proicase inh.buors. loss of cytosolic proteins would be 
expected during pelicing of cells. We monitored the loss 
nl, 'n?H' '^'^"^•'^^^f <LDH) activity into the supcr- 
njtam during mcubaiion in PBS of MDA-23I and MCF- 
7 breast cancer cells at 20 C. In both cases, loss of via- 
b.hiy was paralleled by release of LDH from the cc 

or thL .\1DA-2jI cells were dead (not shown*. 



,hf. n «^'»hrootes. One dear advantage w, h 

higher yield of viable cells, probably due to decreased 
sample preparation time. 

4 Discussion 

ToufVJr^nr. V^^.'^liT "'"P''^ Preparation from 
from .mr, °' ^-^^ ""^P^ ^-O""! derived 

SsmTfro.In '"'T'^ Compared to methods 

techL . .""^'^na'- the resolving power of the 2-DE 
number nf'n^r"'"-!.'^- examination of a large 

nancfe/ ° .h "■P'""'^" '""^^^ of different malig- 

fror^ fro.?i H^-^-^l have used samples 

trom frozen tumors to derive 2-DE maps. We have previ- 
ously described disadvantages encountered us^g frozen 
umor samples including variations in contamfnafing pro 

c ?bedrr'/" ""^P'" "^^ methods de- 

fnmnr? u """^ '"'"'^ °" Preparation of cells from 

sohi t ^^O'led since malignant cells usuallv grow as 
solid masses which are no, stronulv attached to (he 
matrix. Furthermore, we found tha? omitting The enzy- 
ma, ic digestion alleviated the necessity of purifvine 
viable tumor cells on Percoll gradients. This was'in" hirj 
con, as, o cnzymatically trea.ed samples, where loss ?f 
vub.h > leads ,0 loss of high molecular weigh, protein 




mil) 



At Las, .n ihc case of lung cancer, viable and nonviable 
cell, showed small dilTcrenccs in respect to 2-DE maps 
Presumabls. protease inhibitors penetrate cells S 

^^^'T r^ "Penments. we observed 

leak igc 01 cytosolic protein (LDH) from the cells in 
par II c, .0 loss of viability. Apparently, however, on v a 
imnucd decrease of the level of low molecular weight 

con?h 7 ■ 'i^''""^" ^'^^ '^"''''''^ silver staining 

combmcd wMh y.sual inspection. We have found (ha^ 
although some tumors arc well suited for the prepara- 
tion procedure described, others are not. In generS 
eood results were obtained using tumors of the lung.' 
thvTn H '"^ lymphomas. In contrast, cells from 

,h Old adenomas and hypernephroma showed poor via- 

.i Jr u ""^'"^ '° "P"^'^ nonvl- 

able cells trom viable cells, and we can therefore no 
evaluate ,he consequence of the loss of viaSy on 
2-DE patterns, apart from a loss of some low molecular 
weight cytosolic polypeptides. moiccuiar 

Highly differentiated tumors may show lower viability as 
compared w,th poorly differentiated tumors (Dr. Farkas 
from inv'/^'n"^' =°'"'"""'"t'on). A number of samples 
from thyroid tumors were prepared for 2-DE but niost 

" ^^."^ """^ ^= believe that special care 

IS needed during preparation of generally highlv differen- 
tiated tutrior groups. TT,e difference betweeS loss of via- 
bi ity/leakage of LDH of the more differentiated MCF^? 
cells and the less differentiated MDa-231 cells is VnlL 
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, With these obser\-ations (Fig. 8). A number of poiential 
and interesting markers, like tropomyosin isoforms. cno- 
kcraiins and heat shock proteins, appear to be insensi- 
nve to loss of viability during the preparation procedure 
Hon?"'" 1° "^^^ numerous observations of altera^ 

cancerT t\ '^ese polypeptides in breast 

cancers and lung cancers. 

Another problem that ma> occur, irrespective of sample 
pr paration techniques used. ,s admixture of Ivm^So- 
m- f ,h r' "^"s^^ble in smears and it 

mj> therefore be possible to select Ivmphocvte specific 
spots as -internal markers" for the 2-D P^^GE analv^s 
Studies using this approach are m progress Sanrof the 
polypeptides identified are structural rTxible I) Since the 
expression of many of these polypeptides are known ^o 

to deter-mr'rT '"'^ P°»*f ili.v 

nn.n. , ' e.xpress.on simultaneouslv is 

.ons n"fhJ" ''"■''V' °^ altera- 
inoker , ^-^P^"^'"" or intermediate filament proteins 

Si mi A'."'' '° 'i"^'"-^ lumor progres- 

pre sei b^l'." differentfal • 

expressed between normal cells and transformed cells 

d PCnT-t^'T- heat shock pro.em 

anu I LiNA, To ihis end. we have observed alteraiinnc in 
the expression of cy.okera.m 8. hsp 90. and non ZV^e 

nr man , f^^^""" "L. m pre- 
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Large Scale Biology Corporation 



Large Scale Biology Corporation is the leader in the integrated discovery, production 
and application of proteins - the functional units of all biological processes. 

Large Scale Biology Corporation (LSB, Vacaville, CA) and its subsidiary Large Scale 
Proteomics Corp. (LSP, Germantown, MD) are a biotechnology enterprise with the mission of 
accelerating the speed and productivity of the life sciences industry product discovery and 
development programs. Unique among biotechnology companies is LSB's integration of 
technologies to discover, analyze, manufacture and find new applications for proteins - the 
functional units of all biological processes. 

Genomics companies have focused on deciphering genetic information, providing an initial but 
only partial understanding of biological processes. LSB's proprietary protein technologies can 
enable the transformation of genomic information into products such as drug targets 
therapeutics, diagnostics for drug efficacy and toxicity, and traits for agricultural crop's Large 
Scale Biology has gone beyond the "genomics" realm in its business model and developed 
ways to integrate the discovery of gene function with quantitative protein analysis and protein 
manufactunng. This integration of technology platforms favorably positions LSB as a leading 
provider of valuable content to industry leaders in the fields of diagnostics, therapeutics 
vaccines and agribusiness. 

LSB was founded in 1987 with the goal of commercializing its proprietary GENEWARE viral 
vector system - a novel technology for gene expression. Using safe RNA viruses to transiently 
express genes in non-recombinant plants, LSB has positioned itself in the industry to provide 
cost-effective manufacturing and purification of diverse protein and peptide products The 
same technology can be applied to the expression of libraries of foreign genes in an 
automated, high-throughput format to discover the function of genes with unparalleled 
efficiency. The GENEWARE system and associated proprietary technologies form the basis 
tor LSB s functional genomics, biomanufacturing and a variety of proprietary products under 
development. 

From its foundation, LSB understood the need to integrate functional genomic and protein 
manufactunng expertise with quantitative protein analysis and informatics to become a 
world-leader in the protein field. In 1999, LSB acquired a privately held pharmaceutical 
proteomics company ohginally founded in 1985. Large Scale Proteomics Corporation (a wholly 
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owned subsidiary of Large Scale Biology Corporation) is an industry leader in identifying and 
characterizing proteins in all types of biological samples for the discovery and developnient of 
new and more effective therapies, diagnostics, and agricultural products. 

"Proteomics" is the study of the entire complement of proteins expressed in a cell, tissue, or 
organism. Proteomics can significantly improve drug discovery and development because 
most illness is associated with imbalances among, or malfunctions of, proteins. Only a small 
fraction of diseases can be attributed to the presence of a defective gene. Unlike classical 
genomics approaches that discover genes that may relate to a disease, LSP has developed a 
proprietary system called the ProGEx module for directly characterizing proteins associated 
With disease. Using this same technology, LSP can characterize the effects of candidate drugs 
intended to reverse a disease process, and to determine the degree to which this objective is 
achieved free of adverse side effects. 

LSB and LSP have protected their many discoveries though an extensive portfolio of domestic 
and foreign patents and have developed commercial alliances and partnerships to exploit the 
value of their technologies. LSB and LSP scientists and engineers focus on the development 
and application of resources to help clients meet their objectives as well as the development of 
our own propnetary products for subsequent partnering with industry leaders. 

A combined staff of 140 professionals operates from three locations in the United States, with 
a network of collaborators and affiliates throughout the US and Europe. Company 
headquarters, R&D laboratories and its Genomics division are located in Vacaville, Califomia 
about 60 miles northeast of San Francisco. Process development and biomanufacturing take 
place in Owensboro, Kentucky, and LSB's Large Scale Proteomics Corporation subsidiary is 
located in Germantown, Maryland. 

In August, 2000, LSB completed an initial public offering (IPO) of 5 million shares of common 
stock and now trades on the NASDAQ'under the symbol LSBC. 

Leadership - Large Scale Biology Corporation 

Robert L Erwin, Chairman of the Board and Chief Executive Officer, founded LSB'" and has 
served as a director and officer since 1987. Mr. Enwin is the former chairman of the State of 
California Breast Cancer Research Council and currently serves on the University of California 
President's Engineering Advisory Council. He is Chairman of the Supervisory Board of Icon 
Genetics AG. As a co-founder of Sungene Technologies Corp., Mr. Erwin served as Vice 
President of Research and Product Development from 1981 through 1986. He has sen/ed on 
the Biotechnology Industry Advisory Board for Iowa State University. Mr. Erwin received his 
M.S. degree in Genetics from Louisiana State University and is an inventor on several LSB 
patents. 

David R. McGee, Ph.D.,a co-founder of LSB and Senior Vice President and Chief Operating 
Officer, has been an officer since 1987. Prior to joining LSB, Dr. McGee was Vice President of 
Operations at Sungene Technologies Corporation from 1983 to 1987. Dr. McGee received his 
Ph.D. in Genetics from Louisiana State University and sen/ed as a faculty instructor of zoology 
and genetics at Louisiana State University. 

Laurence K. Grill, Ph.D.,a co-founder of LSB and Senior Vice President, Research and 
Development, has served as an officer since 1987. Dr. Grill was the Manager of Plant 
Molecular Biology for Sandoz Crop Protection Corp. from 1984 to 1987 and Senior Research 
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Scientist in the Department of Molecular Biology at Zoecon Research Institute from 1 980 to 
1984. He received his Ph.D. from the University of Califomia at Riverside with an emphasis on 
the molecular basis for viral gene expression in plants. 

R. Barry Holtz, Ph. D., Senior Vice President, Biopharmaceutical Manufacturing, has served 
the company as an officer since 1989 upon the acquisition of Holtz Bio-Engineering, which 
was founded in 1980. Dr. Holtz was a co-founder and Director of Research for MFI. 'lnc, the 
largest manufacturer of microencapsulated nutrients for agriculture and Director of ' 
Fundamental Research at Foremost-McKesson, Inc. Dr. Holtz received his Ph.D. in 
Biochemistry from Pennsylvania State University and served as Assistant Professor in the 
Department of Food Science and Nutrition at Ohio State University. 

Dan/e/ Tuse, Ph.D., has been an officer of LSB since he joined the Company in 1 995 as Vice 
President, Pharmaceutical Development. Dr. Tuse manages the company's pharmaceutical 
design and development programs, including LSB's novel vaccines and immunotherapeutics 
initiatives. Prior to joining LSB, Dr. Tuse was Assistant Director of SRI International's (Menio 
Park, Calif.) Life Sciences Division. In his 17 years at SRI, Dr. Tuse developed extensive R&D 
experience in pharmaceuticals and specialty chemicals, serving an International list of clients 
Dr. Tuse received his Ph.D. in Microbiology (1980, cum laude) with a minor in Toxicology from 
the University of California, Davis. 

John S. Rakitan, a co-founder of LSB, Senior Vice President & General Counsel and 
Secretary, has served as an officer since 1988. Prior to joining LSB, Mr. Rakitan was an 
attorney in private practice. Mr. Rakitan received his J.D. degree from the University of Notre 
Dame. 

Michael D. Centron, Treasurer, has served as Controller since 1988 and was elected as 
Treasurer in 1 991 . Mr. Centron was Audit Supervisor for Varian Associates from June 1 985 
through July 1988, and he also worked for Arthur Young and Co. (currently Ernst & Young). 
Mr. Centron is a certified public accountant and received his M.B.A. degree from the University 
of California at Berkeley. 

Guy della-Cioppa, Ph.D., is an officer of the company and currently serves as Vice President, 
Genomics. Prior to joining the company in 1 989, Dr. della-Cioppa worked for Monsanto 
Company in St. Louis, MO from 1984-1989 and was an NIH Postdoctoral Fellow at the 
Worcester Foundation for Experimental Biology in Shrewsbury, MA from 1983-1984. He 
received his Ph.D. in Biology from the University of California, Los Angeles. 

William M. Pfann joined Large Scale Biology in August 2000 as Senior Vice President Finance 
and Chief Financial Officer. Mr. Pfann was formeriy with PricewaterhouseCoopers LLP from 
1969 to July 2000, most recently as the Risk Management Partner for the Western Region. He 
served in a number of management roles at PwC, including leader of the firm's Silicon Valley 
audit practice. National Director of the networking and communications sector and Managing 
Partner of the Northern Califomia emerging business group, as well as Partner-in-Charge of 
the Oakland and Walnut Creek, California offices. Mr. Pfann received a B.S. degree from the 
University of California, Berkeley, in Business Administration and an MBA in Accounting from 
Golden Gate University. 

back to index 

© 2000 Large Scale Biology Corporation. All Rights Reserved Worldwide. 
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Leadership - Large Scale Proteomics Corporation 

N. Leigh Anderson, Ph D., Chairman, President and CEO of Large Scale Proteomics 
Corporation (LSP'"). Dr. Anderson obtained his B.A. in Physics with honors from Yale and a 
Ph.D. in Molecular Biology from Cambridge University (England) working with M. F. Perutz as 
a Churchill Fellow at the MRC Laboratory of Molecular Biology. Subsequently he co-founded 
the Molecular Anatomy Program at the Argonne National Laboratory (Chicago) where his 
work in the development of 2-dimensional electrophoresis (2-DE) and molecular database 
technology earned him, among other distinctions, the American Association for Clinical 
Chemistry's Young Investigator Award for 1982 and the 1983 Pittsburgh Analytical Chemistry 
Award. In 1985 Dr. Anderson co-founded LSP (originally Large Scale Biology Corp., 
Germantown, MD) in order to pursue commercial development and large-scale applications 
of 2-D electrophoretic protein mapping technology. 

Norman G. Anderson, P/i.D., Chief Scientist at LSP. Dr. Anderson has a distinguished record 
as an inventor. His career includes senior positions at Oak Ridge and Argonne National 
Laboratories (ORNL and ANL), more than 300 scientific publications, and the receipt of more 
than 20 prestigious awards in recognition of his work in science and technology. For his 
invention of the zonal ultracentrifuge, he received the John Scott Medal Award, and for the 
centrifugal fast analyzer, the Preis Biochemische Analytik fur Klinische Chemie from Die 
Deutsche Gesellschaft fur klinische Chemie for the most outstanding analytical development 
in clinical chemistry worldwide during a 2-year period. In 1984 ANL awarded him its career 
patent leader award for the largest number of patents issued to an employee. At that time the 
commercial value of his inventions in terms of U.S. sales and royalties from foreign licensing 
were $250 million and $1 million, respectively. Dr. Anderson received his degrees at Duke 
University: a B.A. in Zoology, M.A. in Physiology, and Ph.D. in Cell Physiology. He holds 28 
patents. 

Constance Seniff,V\ce President, Operations. Ms. Seniff has managed LSP's operations 
since 1993. Her background includes thirteen years in international business prior to joining 
LSP, five abroad in the employ of foreign firms. Ms. Seniff is responsible for helping 
formulate and implement business development and database commercialization strategies 
for LSP in coordination with the management of LSP's parent company, Large Scale Biology 
Corporation. Ms. Seniff has a B.Sc. degree in Business (with honors) from Florida State 
University. 

Robert J. Walden, Vice President, Finance at LSP. Mr. Walden joined LSP in 1997 and has 
served as a director since 1999. He previously served as Vice President of Finance and 
Administration at Osiris Therapeutics, Inc., and as Chief Financial Officer at the American 
Type Culture Collection (ATCC). Mr. Walden received his degree in Finance from the 
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